Gene Francci3_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0207 
Symbol 
ID3905374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp245245 
End bp246375 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID637877536 
Producthypothetical protein 
Protein accessionYP_479325 
Protein GI86738925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0771524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGTT TTCGTGGTAC CGGAGCGCCA GGAAGGACCG AGGGACCCGA TGGAACGTGC 
AGCGTGACGG CCCGGCAGCG AGGTCGGGCC GGGTACCGGG CTCGCCCGGG CAGGCCGTTA
TGGATGCCGC TGACCGCACT GACGATGGTG ATGGCTGTTG TGCTGGGCGT GGCGGCCTGC
TCGGATCAGC CCGAGCGTCC CCGCGCTCCG CGGCCGTCAC CAACCACCAC GCCCCCCTCC
GCCGTGGAGC CGGCACCGTC CGGGCTGGGC TGGGTATCCG GCGCCAATGG AAACTTCCCG
GCAGACGTGA CCGCGTGGGC GGCGTGGACC CGCCGTCCCG TCGACCTCGC GATCGTTTTC
ACGGACCGCG CCAACTGGCC GAGCATCACC ACGGCGTCCT GGCCGGTCGG GGCGTTCACC
CGGGCAGCCT TTCCCGGGGA GCTGTCGGTG GCGCAACCGC TGTACCCGCA GAACGGCAAT
GAACAGGCGT GCGCTCGCGG TGAGTACGAC GGCTACTGGG CCCAGTTCGG GCAGACTCTG
TCGAAGTACG GCCGCGGCGA CGCCTACGTG CGCCTCGGCT GGGAGTTCAA CGGGGACTGG
TTCTGGTGGC ACGTCCGGGA TCCGCAGGCG TGGAAAAGCT GCTTCCAACA CGCCGCCACC
GCGATCCGGT CGACCGCTCC GCACGTGAAG ATCGACTGGA ACATGACCGC GCACCGCGAC
AGCCTGCCCG GCAGCGGCGC GGACGTCTGG TCCGCCTACC CCGGCGACGC CTACGTCGAC
GTCGTCAGCA TCGACTCCTA CGACTCCTAT CCGGCGTCCA CGACGGAGCA GGTCTGGACC
CGGCAATGCC AGCAGCGTTC CGGGCTGTGC ACCGTCGCCG CCTTCGCCCG CGCCCACGGC
AAGCGGTTCG CCGTCCCCGA ATGGGGACTG GTGCGCTCGA CCGGCGGTGG CGGCGACAAC
CCGTTCTACA TCGAGAAGAT GCACGAGTTC TTCGCGGCGA ACGCCGGACT CCTCGCCTAC
GAGGCCTACT ACAACAACGC GGAAGCTGAC AACGTGCAAT CCTCCCTGCA CAACCCGAAC
CTGAGCCCGA ACAGCTCCCG CCGCTATCTC GGCCTGTTCG GAAAGGACTG A
 
Protein sequence
MVGFRGTGAP GRTEGPDGTC SVTARQRGRA GYRARPGRPL WMPLTALTMV MAVVLGVAAC 
SDQPERPRAP RPSPTTTPPS AVEPAPSGLG WVSGANGNFP ADVTAWAAWT RRPVDLAIVF
TDRANWPSIT TASWPVGAFT RAAFPGELSV AQPLYPQNGN EQACARGEYD GYWAQFGQTL
SKYGRGDAYV RLGWEFNGDW FWWHVRDPQA WKSCFQHAAT AIRSTAPHVK IDWNMTAHRD
SLPGSGADVW SAYPGDAYVD VVSIDSYDSY PASTTEQVWT RQCQQRSGLC TVAAFARAHG
KRFAVPEWGL VRSTGGGGDN PFYIEKMHEF FAANAGLLAY EAYYNNAEAD NVQSSLHNPN
LSPNSSRRYL GLFGKD