Gene Francci3_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2656 
Symbol 
ID3906329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3135839 
End bp3137425 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content73% 
IMG OID637879981 
Product2-methylcitrate dehydratase 
Protein accessionYP_481747 
Protein GI86741347 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.298915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.184722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGC ACGAGGTGCG CGCCCGCCGG TCCGTCGAGG CGCCGCCCCG GCGTGACCAG 
CTTGCCTGGA AGATCGCCGA GGTCGCCGCC GAGCGGGTGC CGGTGCCGCC GGAGGTCGTC
GAGATGATCG GCAACCGAAT CATCGACAAC GCGGCGGTCG CCGCGGCGGC GCTGACCCGC
GGCCCGGTCG TCGCGGCCCG CGACCAGGCG CTCGCCCATC CGTACACCCC GGGAGCGACG
GTCGTCGGCG TCGAGCGGGC AGTACGGGTG TCGCCGGAGT GGACGGCCTG GGCGAACGGC
GTCGCCGTGC GGGAACTGGA CTTCCACGAC ACCTACCTGG CCGCGGACTA CTCCCATCCG
GGCGACAACA TCCCCCCGGT GCTCGCCGTC GCGCAGCACA CCGGCCGCGG CGGCGCCGAG
CTGGTCCACG GGATCGCCAC CGCGTACGGG ATCCAGGTCG ACCTGGTGAC CGGCATCTGC
CTGCACGAAC ACCGCATCGA CCACATCGCC CACCTGGGCC CCTCCGCCGT CGCCGGGATC
GGTGCCCTGC TCGGCCTGCC CCCCGAGCCG ATCTACCAGG CCGTCGGCCA GGCGTTGCAC
ACGACGACGA CCACCCGTCA GGCCCGCAAG GGCGAGATCT CCACCTGGAA GGCGTACGCC
CCGGCCTTCG CCGGCAAGAC GGCGGTGGAG GCGGTCGACC GGGCGATGCG CGGCCAGACC
TCGCCGGCGC CGATCTACGA GGGGGAGGAC GGGGTCGTCG CCTGGCTGCT CGGCGGCCCG
GACGCGGTGT ACCGGGTGGC GCTGCCGGAA CCCGGCGAGC CCCGTCGGGG CATCCTCGCC
ACCTATCCCA AGGAGCACTC CGCCGAGTAC CAGAGCCAGG CGCTGATCGA CCTGGCCCGC
CGGCTGCGTA CCCGGCTGCC CGGGACCGGC TCGTCTCTCG GGGCCGGCTC GTCTCTCGGG
GCCGGCTCGT CCACGGCCTC CTCCGAAGCC TCCTTCGAGG TCGACGTCGC GGCGATCCGC
CAGATCGTCA TCCACACCAG CCATCACACG CATCACGTGA TCGGCACCGG CGCCGGGGAT
CCGCAGAAGG CCGATCCGAC CGCGAGCCGG GAGACGCTCG ACCATTCGAT CATGTATATC
TTCGCGGTGG CGCTGCAGGA CGGGACGTGG CACCACGAAC GTTCCTACGC CCCCGAACGG
GCCGCCCGGC CCGACACCGT CGCGCTGTGG CACCGGATCC GCACGGTGGA GGATCCGCAG
TGGACCCGCC GTTACCACGC GACCGACCCG GCCGAGCGGG CCTTCGGCGG GCGGGTGGAG
GTCACGCTCG TTGACGGGAC GTCGATCGTG GACGAGATCG CCGTCGCCGA CGCGCATCCG
GCCGGGGCCC GGCCGTTCCG GCGTGCGGAC TACGTCGCCA AGCTGCGTAT GCTCGCCGAG
GGGGTCGTGT CGGCCGCCGA ACAGGACCGG TTCCTCGACC TGGTCGGCCG GCTCGACACC
CTGACCCCGG CCGAGCTCGC CGGGCTGACC CTCGTCGCCG ACGCGCTCGC CCTGGAAACG
GGCGGGACGA GGGGGGTCTT CGCATGA
 
Protein sequence
MKLHEVRARR SVEAPPRRDQ LAWKIAEVAA ERVPVPPEVV EMIGNRIIDN AAVAAAALTR 
GPVVAARDQA LAHPYTPGAT VVGVERAVRV SPEWTAWANG VAVRELDFHD TYLAADYSHP
GDNIPPVLAV AQHTGRGGAE LVHGIATAYG IQVDLVTGIC LHEHRIDHIA HLGPSAVAGI
GALLGLPPEP IYQAVGQALH TTTTTRQARK GEISTWKAYA PAFAGKTAVE AVDRAMRGQT
SPAPIYEGED GVVAWLLGGP DAVYRVALPE PGEPRRGILA TYPKEHSAEY QSQALIDLAR
RLRTRLPGTG SSLGAGSSLG AGSSTASSEA SFEVDVAAIR QIVIHTSHHT HHVIGTGAGD
PQKADPTASR ETLDHSIMYI FAVALQDGTW HHERSYAPER AARPDTVALW HRIRTVEDPQ
WTRRYHATDP AERAFGGRVE VTLVDGTSIV DEIAVADAHP AGARPFRRAD YVAKLRMLAE
GVVSAAEQDR FLDLVGRLDT LTPAELAGLT LVADALALET GGTRGVFA