Gene Francci3_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1209 
Symbol 
ID3903563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1443249 
End bp1444679 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content74% 
IMG OID637878542 
Productfolylpolyglutamate synthase-like 
Protein accessionYP_480316 
Protein GI86739916 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.503356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.49053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGCG CTGCCGGGAC CGCCGGAACC ACCCATGCTC CTGATCACGA GCGGGAGGGT 
CCACGCGCGT TCGCGGCCGA GTACTCCCAA CGCCGCGACA TCCGCGCCCG GCCGGGCTTC
GACCGGCTGC GCCGTCTGCT GGATCTGCTC GGCCAGCCGC AGCGGGCCCT TCCGAGCATC
GTGGTGACCG GTTCGGTGGG GAAGAGCTCC ACGGTGTCCG CGATCACCGC ACTGCTGCAC
TCGTTCGGCA TCCGCGCGGG ATCGTTGAGT TCGCCGGGCT GGCTCGGGGC CGAGCAGCGG
GTGGGGTTGG CCGACGCCAC GATCGATTCC GACGTCCTCG GCCGCGCCTA CGCCGATATC
GCGCCGTACC TGCCGCTGAT CGAGGATGAC GGCCGGCTGA GCGCCTTCGA GCTGCTTGCC
GCCGTGGCCT TCGCCGCCTT CGCCGACGCC CCCGCCGACC TCGCCGTGAT CGAGTCCGGC
TGGGACGACG CCGGCGAGCT GAGCGCACTG CTCGACGCGC CGGTGTCGGT CGTCACCCCC
GTGGCGGTGG GCGAGTGGGA GCCGGCGGCG GGCCTGGACC GGATCACCAC GCGGACGGCG
GGAGTCATCC GCGCCGACAC CCTGGTCGTG CTGGCCCAGC AGGTCCTGCC CGCCGCGCAG
GCGCTGCTGC GGCGCGCCGC CGAGACCGGG GCGATGGTCG CCCGGGAGGG GCTGGAGTTC
GGTGTACTCG GCCGGCAGGT GGCCGTCGGG GGGCAGATGA TCACGCTGAA GGGGCTGGGC
GGCACCTACG AGGAGATCTT CCTGCCGCTG CACGGCGCGC ATCAGGCCCA TAACGCGGCG
GTCGCGCTGG CCGCGGTCGA GGCGTTCCTC GGCGGCGGGC GCAACCAACT CGACGTCGAT
GCGGTCCGCG CCGGTTTTGC CGAGGTGGAC TCCCGGGGCC GGCTGGAGGT CGTCCGCCGG
TCCCCGACGA TCATTCTTGA CGTGGCCGGC TCGCCTGTGG CGGCGGGCGC GCTGGCCGCG
GCGCTCGACG AGGCGTTCAC CTTCGACGTC CTTGTCGGGG TCATCGCCAT CGAGCGCGAG
GTCGAGAGTG CCGCCGCTCG GGACGGTGGT GGCGAGCGGG ACGGTGCCGC CGAGCGGGAC
GGTGCCGCCG AGCGGGAGGC AGCGGAGCTC CTGGCCGTCC TGGAGCCGGT CCTGAACACG
GTGGTGGTCA CCGCCGGTGC TGGGCCCGGA TGGCTGCCGG TCGACGGTCT CGCCGCGGTC
GCCGTGGGTG TCTTCGGCTC CGACCGGGTG GAGGTCGTAC CCCGGTTGGA CGACGCGCTC
GACGCCGGGG TGCGCCTCGC CGAGGAGGAC GCCGATCTGG GTGGCGCCGG CGTGGTGGTC
ACCGGTTCGG CCTCCGTCGT GGGTCGGGCC CGGGCGTTGC TGCGTCCGTG A
 
Protein sequence
MTGAAGTAGT THAPDHEREG PRAFAAEYSQ RRDIRARPGF DRLRRLLDLL GQPQRALPSI 
VVTGSVGKSS TVSAITALLH SFGIRAGSLS SPGWLGAEQR VGLADATIDS DVLGRAYADI
APYLPLIEDD GRLSAFELLA AVAFAAFADA PADLAVIESG WDDAGELSAL LDAPVSVVTP
VAVGEWEPAA GLDRITTRTA GVIRADTLVV LAQQVLPAAQ ALLRRAAETG AMVAREGLEF
GVLGRQVAVG GQMITLKGLG GTYEEIFLPL HGAHQAHNAA VALAAVEAFL GGGRNQLDVD
AVRAGFAEVD SRGRLEVVRR SPTIILDVAG SPVAAGALAA ALDEAFTFDV LVGVIAIERE
VESAAARDGG GERDGAAERD GAAEREAAEL LAVLEPVLNT VVVTAGAGPG WLPVDGLAAV
AVGVFGSDRV EVVPRLDDAL DAGVRLAEED ADLGGAGVVV TGSASVVGRA RALLRP