Gene Francci3_3252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3252 
Symbol 
ID3904423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3850784 
End bp3852388 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content73% 
IMG OID637880577 
Producthypothetical protein 
Protein accessionYP_482338 
Protein GI86741938 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACC CGAGCATGCT GCCGCGGCCG CCGGGACGCG GGCAGGCTCG TGCCATGAGT 
CTGGAGACCA CGGTGAACCC GCTCCCGGCG GAGCGAGGCG CCTCGGCCTC GGCGCCAGCG
GTGGCGCCGA CGGTGGCGTC GGGGCCTCCT GCCAGGGTCG TCGAGACGGC GCGGTCCGTG
CTCGTCTTCC TTGGAGATCG TGTCTTCAAG GTCAAGAAGC CGGTCGACCT CGGTGCCGTG
GATTTCCGCG GCAGGCAGGC GCGGTTGGCC GCCTGCGAGG CCGAGGTGAG ACTCAATCGT
CGCCTCGCGC CCGACGTCTA CCTCGGCGTC GCCGATGTCA TCGGACCGGA CGGGGAGCCC
TGCGACCACA TGGTCGTGAT GCGGCGACTA CCCGAGGCGC GCCGGCTGTC GACGCTTGCC
GAAGGTGGCA CCGAGGTCAG GGCGGAGATC CACGCCCTCA CCCGGGTGCT GGTCGACTTC
CACGCCCGGT GTGAGACCTC GTCCCGGATC GCCGAGGCAG GTGGCCTGGA CCGTCTGCGT
GGGCGGTGGG ACGCCTGCTT CGCCCGGGTA CAGCGTGACC ATGGCGCGGC GGTGAGCGCC
AGCATCCTCG ACCATGTGAA CCGTCTCGCC GTGCGGTACC TCGACGGCCG CGACGAGCTC
CTGCGGGAGC GGCGCGAGGC CGGGCGGATC CGCGACGGGC ACGGCGATCT GTCCGCCGCG
GACATCTTCT GCCTCGACGA CGGGCCCAGG GTGCTCGACT GCCTGGAGTT CGAACCCGGG
CTGCGGGCGG CCGACGTCCT CGCGGACGCC TGCGCCCTGG CAGCCGACCT CGAGTGGCTC
GGACGCCGCG ATCTCGCCCG GCTCTTCCTC GATCACTACC GTGAGATGGC TGGTGAGACC
CATCCTCGGT CGCTCGAGGA TTTCTACTGG GCGCTGGCCG CGCTGGGGCG CTGCCAGGCG
GCGTGCCAGC GTGTCGCGGC CGGCGAGAAC GCGGCGGCGG AGGCGCGGGC CTTCGCTGAC
CTGGCACTGG CCCGGTTGCG CTGGGGCCGG GTCCGACTCG TGCTGGTCGG CGGCCAGCGC
GGCACCGGGA AGTCCACGCT CGCCGGCGGG CTCGCCGGCA CGGAGCGGTG GACCGTGCTC
CGCTTCGACG ACGCCGCGGC GGACCTGGCG GCCTCGGCCA ACCGTCACGA TCTCGCGGCG
GGGGGATGGG CAGATGCAGG GGGATGGGTA CCGGCCGACG ACGTCGACGC GGTCCACCAG
GAACTGCTGC GCCAGGCCGG CACAGCGCTG CGCCGCGGCG AGTCCGTCGT GGTCGACGCA
CCGTGGAACC GGCACAGCCA GCGCGCGCAA GCGGCCGATG TCGCTCGCCG TGCCTTCGCG
GACCTGGTGC AACTGCGCTG CACGGCCCCG CCCGATCTCG CGGCAACCCG TACCGACCGG
CGTTCCCCGG CAACCACCGC GGCAACCAGC GCCACAGGGT CCGTCGGCCT TGGCCGTCTC
GCCGACACCG TCTCCCGGAT CGAACCCTGG CCGGAAGCCA AGATCATCGA TACGGCGGTG
GCCATCGCCG AGTCGCTGCA CAACGCCCGC CGCGCCGCGG CCTGA
 
Protein sequence
MHDPSMLPRP PGRGQARAMS LETTVNPLPA ERGASASAPA VAPTVASGPP ARVVETARSV 
LVFLGDRVFK VKKPVDLGAV DFRGRQARLA ACEAEVRLNR RLAPDVYLGV ADVIGPDGEP
CDHMVVMRRL PEARRLSTLA EGGTEVRAEI HALTRVLVDF HARCETSSRI AEAGGLDRLR
GRWDACFARV QRDHGAAVSA SILDHVNRLA VRYLDGRDEL LRERREAGRI RDGHGDLSAA
DIFCLDDGPR VLDCLEFEPG LRAADVLADA CALAADLEWL GRRDLARLFL DHYREMAGET
HPRSLEDFYW ALAALGRCQA ACQRVAAGEN AAAEARAFAD LALARLRWGR VRLVLVGGQR
GTGKSTLAGG LAGTERWTVL RFDDAAADLA ASANRHDLAA GGWADAGGWV PADDVDAVHQ
ELLRQAGTAL RRGESVVVDA PWNRHSQRAQ AADVARRAFA DLVQLRCTAP PDLAATRTDR
RSPATTAATS ATGSVGLGRL ADTVSRIEPW PEAKIIDTAV AIAESLHNAR RAAA