Gene Francci3_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3101 
Symbol 
ID3904227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3673327 
End bp3674622 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content75% 
IMG OID637880422 
Producthypothetical protein 
Protein accessionYP_482187 
Protein GI86741787 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.218245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.651768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACC AGAACCCGCC TCACCAGCCC CCCAGCGGTG ACGGGAGCCA GGAGGCGTGG 
AGGCAGCCGA GTGATCCCTG GCACCAGCCC GCGGACGCCG GCAGCACCCC CACCTCCGGT
TCGTCCTCTA CTCCCCCGGC GGCCCCGGTC CCATCGTGGG GAGCGCCCCA GCCCGGCCAG
CCGAACCAGC CCGACTCGGA GGAGACACGC CTCGCCTGGC CAGGCACCGC CCCACAACCG
GGTGGGCAGC AGTGGGGCGG GCAGCCCGGC GGCCCGCAGC ACCCCGGCGC CCCACAGCAC
CCCGGCGCCC CACAGCACCC CGGCGCCCCA CAGCACCCCG GCGCCCCACA GCACCCCGGC
GCCCCACAGC ACCCCGGCGC CGCGCCAGGG TGGGGACAGC CCGGCGAACG GCAGGGCTGG
GGACAACCGG GCGGCTGGCA ACAGCCGGAC CATCCCCCGG CCGCGGCTGC CGGGCCCTAC
CAGGGTGGGA GTGCCCCATA CCCGGGGACG GTCGGCTACC AGGGCGCTCC CGACGCCTAC
CAGAGCCCCC CGGGGGCTTA CCACAACCTC CCGGGGGGCT ACCCGCCGCC CGGCGGCTGG
CAGCAGGGCA GCCCGCTGCC TCCGCCCCGG CAGCGGGCCA ATCCGCTGTT CATCATCGTG
CCCCTCGCCG TGGTCGCCGC CATCGTGCTC GGTGTCGTGA TCGCCCTCGC GGTGCGCGGC
GGTGACGGCA CCGAGATCGC CTCCCCGCCG GCCCCGACGG TGACCGTGCC CGACCTCCCG
AACACCACTG CGACCGCGAC CGCGACCGCC CCGACCGCGG CCGCCCCGAC CGCGACCGCC
CCGAACGGAG TGCCGAACTG CGTTCCGGTG ACGCCACAGC ACGCGCCGGC GGCCGGCACC
GCGACGATCG GCGGGTCCGG CACCGCCGTC GGCCGGGCCA CCTCCTCGGT CAGCGACTTC
GAAGCCAGGG TCACCCTCAA CAGCGTCTGC TCCACCACCG GCAAGGTGAC CCAGTACGGC
GACGCGCCCA CCCAGGGGGC CTACTACGTC ATTAACGTGA CGGTCGAGGT GAGCCGCGGG
TCGACGTCCG CCTCGCCGTC CGACTTCTAC ATCCAGACCT CCGACGGCAC CCGGTACGAC
GGCAGCTACG AAAACGTCGC GCCACGGCTG TCGGTGTCGA CTCTCAAGGC CGGGCAACGG
GTCCGGGGCA ACATCGTCAT CGATGCGCCG CCCGGGCACG CCATCCTGAA CTGGGAACCG
CTGTTCGCCG TCGATCCGCC GAAGTTCCAG CTCTGA
 
Protein sequence
MTDQNPPHQP PSGDGSQEAW RQPSDPWHQP ADAGSTPTSG SSSTPPAAPV PSWGAPQPGQ 
PNQPDSEETR LAWPGTAPQP GGQQWGGQPG GPQHPGAPQH PGAPQHPGAP QHPGAPQHPG
APQHPGAAPG WGQPGERQGW GQPGGWQQPD HPPAAAAGPY QGGSAPYPGT VGYQGAPDAY
QSPPGAYHNL PGGYPPPGGW QQGSPLPPPR QRANPLFIIV PLAVVAAIVL GVVIALAVRG
GDGTEIASPP APTVTVPDLP NTTATATATA PTAAAPTATA PNGVPNCVPV TPQHAPAAGT
ATIGGSGTAV GRATSSVSDF EARVTLNSVC STTGKVTQYG DAPTQGAYYV INVTVEVSRG
STSASPSDFY IQTSDGTRYD GSYENVAPRL SVSTLKAGQR VRGNIVIDAP PGHAILNWEP
LFAVDPPKFQ L