Gene Francci3_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3226 
Symbol 
ID3906193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3818643 
End bp3819974 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID637880551 
Producthypothetical protein 
Protein accessionYP_482312 
Protein GI86741912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.407522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCATG ATCCGTCCGC GTTCGTCATA CCGGCGGAAT TCTGGGATCG GACCCAGGTC 
GCCGAGAGCC TGCAGATGAG GGACATCGGT TCCCTCTTCC GGCTGGTACA GCGGTACGCC
GGGGCGAGTC AGCCAGGGAT CGGGATGCGC GTCGGCCTGG CCCAGTCAGA TATCAGCAAG
TACATCAACG GCAAGCGAAT CGCCGCCGAG TTCGAGTTGT TCGAGCGCGT CGCCGATGGG
CTCGACCTGC CCGACCGTGC CCGCATGTTG ATGGGGCTGG CACCGCGCGG CGCCGTCCAG
TCTCCGGAGG CCTCACGGAC GAACATCGTG GAACAGACCG CCCTTCCCGA GGAATCGGAT
TCCGTCGAGG AGATCGGGCA GCGGATCGAG ACGCTCGGTG CATCCAACGT CAGCACGGCC
GTTCTCGCCC ACTTCGACGT TCTGCTCCTG ACCATGGCGG ACGAGTACGA GTGGGCCGGC
CCGGAAAAAC TCGCACCACG GGTACTCAGG CAACGCCGCC GGGTTCAGAA CCTCCTGGAA
GGACGACAGC CGCCCCGGCA GCGTGAACGG CTCTACGAAA TCGCCGGTCG CCTCTCCGGA
ATACTCGGCT ACATGGCGGT GAACACCGGC CGGTTCGGAC TCGCGCGCGC CTACTGTCTG
GAGGCTCTCC ACACCGCTGA GCTGGTCGGT CACCACGACC TCACGGCATG GATAGGCGGC
ACGCAGAGCC TGTGTGAGTA CTACGCCGGG GACTACAGAG CTGCCCTGGA GTTCGCCCGG
GAGGGTCGCC GTGTCGGTGG CCGGTCGGCC CAGGTGATCC GCCTCGCTGT GAACGGGGAG
GCACGCGCCC TCGGTCGGCT CGGTGACCGC GCCGGGGTGG GCCGATCCGT GGGCGAGGCG
TTCGACCTCG CCGAACAGCA TCCGGTGCCC GACGGGATGT CGCCCTGCAT CTCCTTCGCG
CCGTACAGCA TCGCCCGTAT CGCCGCGAAC GCCGCGACCG CCTATGTCTC ACTGGGCGAG
CCCGGTCAGG CGCGGGAGTA TGCGGACATG GCCGCTCAGG TCGCAGACCG ATCCCCGTCG
ATGTGGAGCC GTTGCCTCGT CCGCCTCGAC CTCGCGACCG CGCTGCTGCT GTCCCACTCC
CCCGATCCGG AGCAGGCTGC CGTCCTGGGC ATCGAGGCTC TGACCGCCAC GGCCGGCAAC
CCGATCGAGT CCGTCCGACG TCGTAGCCAT GAGCTGGTCA CCCGCGCCAA GCCGTGGCAG
CAGATCGGCC CGGTCACCGA ACTCGCGGAG GCTTCCCGCG CGCTCGCCCT CCCCACCGGC
GCACACCGGT GA
 
Protein sequence
MRHDPSAFVI PAEFWDRTQV AESLQMRDIG SLFRLVQRYA GASQPGIGMR VGLAQSDISK 
YINGKRIAAE FELFERVADG LDLPDRARML MGLAPRGAVQ SPEASRTNIV EQTALPEESD
SVEEIGQRIE TLGASNVSTA VLAHFDVLLL TMADEYEWAG PEKLAPRVLR QRRRVQNLLE
GRQPPRQRER LYEIAGRLSG ILGYMAVNTG RFGLARAYCL EALHTAELVG HHDLTAWIGG
TQSLCEYYAG DYRAALEFAR EGRRVGGRSA QVIRLAVNGE ARALGRLGDR AGVGRSVGEA
FDLAEQHPVP DGMSPCISFA PYSIARIAAN AATAYVSLGE PGQAREYADM AAQVADRSPS
MWSRCLVRLD LATALLLSHS PDPEQAAVLG IEALTATAGN PIESVRRRSH ELVTRAKPWQ
QIGPVTELAE ASRALALPTG AHR