Gene Francci3_3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3370 
Symbol 
ID3905952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3998881 
End bp4000737 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content70% 
IMG OID637880693 
Productcobalamin B12-binding 
Protein accessionYP_482454 
Protein GI86742054 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.818313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0333946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGA CGCCCGCCGA GGGGGCCCCT CGGATACTTC TGATCAAACC GCCGATCCGG 
GCCTGCATGA TCGAGATCGG TCGACACATG CCGATCGGCC TGGCCTACCT GGCGTCCCGC
CTGCGTGCGG CGGGCATGCA GGTCGACATC TTCGACGCGC TGGCCAACAC CGAGGACAAC
CACGTGGTCG ACCCGGCCGA CTACACGGCG GCCGACCGTG CCAAGGTCGC CGCACATCCG
CGCTGGGGGC ACCTGGTCCA CTGGGGCGCG CGCTGGGACC GGCTGGAGCA GCAGCTACGC
CGCGGCTACG ACCTGGTCGG CGTGTCCTGC ATGTTCACCC CGTACTACGA GCCGGCCTAC
GAACTCGGTC GGCTCGCCAG GCGGATCCTG CCGCACGCCC GGGTCGTGCT CGGCGGTCAG
CACCCCACGG TCGCCTTCCA GCACGCCATG ACCGAGCCCG CCTTCGACGC GCTGGTGCTG
GGCGAGGCCG AGTCGAACGT GGTCGAGATC GTGCAGGCCC TGCTGGACGG AGCGCCCCTG
ACCGGGCTGG CCGGCGTGGC CTATCGCTGC GGCACCGGCC TGTGCGGGTG TCCGCCGGGC
GGCCCTGGCG TCCACGTGCA GCCGCGTACC GAGTTCCTCA CCGATCTGGA CGGCCTGCCC
CTGCCGGCGG TCAACCTGCT CGAGATGAGC AGCTACGACT CCACCGCGAC GCTGATCACC
AGTCGCGGCT GCCCCTTCTC GTGCTCCTTC TGCACGGTGC ACGCCACCGT GGGCAGGAAG
TTCCGGGCCC GCAGCCCGCA CAACGTCGCC GACGAGATCG AGCACTACGT GACCGACCAC
GGCATCCGCC GGTTCTTCGT CGAGGACGAC AACTTCACCT TCGACGTGCC CAGGGTGCGC
GAGATCTGCC GCGAGATCAC CCGCCGTGGG CTGGACGTCG AACTGCACCT GCCCAACGGG
ATGACCGTGG TCGACCTGGA CGGCTCCCTG GTGGACGACA TGGTGTCGGC CGGGTTCCGC
AGCCTGTTCC TGGGCCTGGA GACCACCGAC CCCGTCCGCC TGCGCCGCAT CCGCAAGGGT
TTCACCTCGC TGCAGAAGGT CAACGACGGT GCGGCCCTGT TCCGGGAACA GGGGGTGGAC
GTCGGCGCCT CGCTGATCGT CGGCCTGCTC GGCCAGGAGT GCAGCGAGCT CGCCCGCGAC
TCGGTCAACC TCATGCTCGC CGGGGTGCGT TTCTGGACCA ACCCCTTCTA CCCGATCCCC
GGCTCCCCCG ACTTCGAGCG CTGCCTGCGC ACCGGACTGA TCACCCCCAC CACCGAACTG
GCGCTGTACG ACCAGTTCAA CTTCGCCATC GGTTCGGACC GGCTCAGCCC GGCCGAGCTG
TACTGGGCCT GTGTGGTCAC CCAGGCGATG GCCCAGTGGA CCGGCTATCT CCTGGAGGGG
GCGCACGCGC GCCGCCGGGG AGCCGCGGCG ACCACCGAGC AGGCCCTGGA ACGGCTGGTC
CAGCACAGCG GTGACCTGTT CGGCCCGCAC GGCGAGCTCG AGGTCCCGGC GGTGCCGGTG
GCGGTCGACC GGCAGACCGT GCGCGTGCAT CCCGACGGGT GCTTCCACAG CATGCAGCGC
CTGGACCCGG CCCGGCTACC GGCGGACTCC TGCACCTTCA CCGGGGACGT CATCGCCGCG
GCCGTCACCC TCCACACGGG TGTCGCGCAC GGTGGGCGTC AGAACGCCGG GACCGGTTCC
CCCTGTGCTT TCACCATCAG CACGACCTTG GCCGATCCGG TCGGCGCCGC GGTGGCGGCC
ACCGTGCTCA CCGAACTGGA AACGGCGCTG ACAACGAACA TGGAGGTTCT CGGGTGA
 
Protein sequence
MTWTPAEGAP RILLIKPPIR ACMIEIGRHM PIGLAYLASR LRAAGMQVDI FDALANTEDN 
HVVDPADYTA ADRAKVAAHP RWGHLVHWGA RWDRLEQQLR RGYDLVGVSC MFTPYYEPAY
ELGRLARRIL PHARVVLGGQ HPTVAFQHAM TEPAFDALVL GEAESNVVEI VQALLDGAPL
TGLAGVAYRC GTGLCGCPPG GPGVHVQPRT EFLTDLDGLP LPAVNLLEMS SYDSTATLIT
SRGCPFSCSF CTVHATVGRK FRARSPHNVA DEIEHYVTDH GIRRFFVEDD NFTFDVPRVR
EICREITRRG LDVELHLPNG MTVVDLDGSL VDDMVSAGFR SLFLGLETTD PVRLRRIRKG
FTSLQKVNDG AALFREQGVD VGASLIVGLL GQECSELARD SVNLMLAGVR FWTNPFYPIP
GSPDFERCLR TGLITPTTEL ALYDQFNFAI GSDRLSPAEL YWACVVTQAM AQWTGYLLEG
AHARRRGAAA TTEQALERLV QHSGDLFGPH GELEVPAVPV AVDRQTVRVH PDGCFHSMQR
LDPARLPADS CTFTGDVIAA AVTLHTGVAH GGRQNAGTGS PCAFTISTTL ADPVGAAVAA
TVLTELETAL TTNMEVLG