Gene Franean1_4198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4198 
Symbol 
ID5672553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4998404 
End bp4999621 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content78% 
IMG OID641243071 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001508488 
Protein GI158315980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.780011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTC TGGCCGGCGC CCGGGTGGTG ACGCCGCACG GTGTCCTCGA CCCCGGCCGG 
GTGCGCGTCG AGAACGGCCT GATCACCGAG GTCGGCACCG AGGCCGGCAC CGAGGTCGGG
CCCACGGCCG GGCCCGTCGG TGGGGAGGCG GGCGGCGAGG GGACGGGCGG CGAGGACATC
GTCGACCTGG GCGGGTCCTG GCTGGTGCCC GGGTTTGTCG ACCTGCACGT CCACGGCGGG
GGCGGGCACG ACGTCACGGC GTCACCGGCC GATCTGGCCG CGGCGGTGGC CTTCCACCGG
GCGCACGGCA CGACCCGCAC GCTGGTCTCG CTGGTGGCGG CGCCGGTGGA GCGCCTGGCC
GAGCAGTTGT CCTGGGTGGC GGCGCTCACC GCGACCGGGC CGGGGCCGGA CGGCCATGTG
GTCGGCGCGC ATCTGGAGGG GCCGTTCCTC GCGCCCGCGC GCCGTGGCGC CCAGCCGGGT
GAGCATCTGC GCGGGCCTGA CCGCGGTGTG TTCGCCGAGC TCGTCGCGGC GGGCGCGGGC
ACGCTGCGGG TGATCACCCT CGCCCCGGAG CTGCCCGGGG CCGGCGCGGT GACCGAGGCC
GCGCTCGCGG CGGGGGTGAT CGCCGCCGCC GGCCACACGG ACGCCACCTA CGACGAGGCC
GCCTCCGGTT TCGCGGCTGG CATGACGCTC GCCACCCACC TGTTCAACGG CATGCGCCCG
CTGCATCACC GCGAGCCCGG CCCGGCCGGG GCGGCGCTCG ACGCCGGTGT GGCCTGCGAA
CTGATCAACG ACGGTGTGCA CGTGCACCCG GCGCTGCTGC GCCTGGTCGC CGCCGAGCCG
GCGCGCCTGG TGCTGGTCAC CGACGCGGTC GACGCGGCGG GTGTCGGCGA CGGCGACTAC
CTGCTGGGCG GCCACCCGGT CCGGGTCCGG GACGGGCAGG CCCGCCTGGC CGCCACCGGC
GCGCTCGCCG GCAGCACCCT GACGATGGAC CTGGCGGTGC GCCGCGCCGT CGCGGCCGGG
CTTGCGCTCG AGGTGGCGGT CGCCGCCGCG GCGACGAACC CCGCCCGGGT GCTGGGCCTC
GCCCACCGCT GCGGGTCGAT CGCCCCCGGG CTGGACGCCG ACCTCGTCGT GCTCGACGCC
GATCTACGGG TCACGAGGGT CATGGCGGCC GGCAGGTGGG TCCCCGGTCC GGCTACCCGG
CCGATCGCGG CCGGGTAG
 
Protein sequence
MIVLAGARVV TPHGVLDPGR VRVENGLITE VGTEAGTEVG PTAGPVGGEA GGEGTGGEDI 
VDLGGSWLVP GFVDLHVHGG GGHDVTASPA DLAAAVAFHR AHGTTRTLVS LVAAPVERLA
EQLSWVAALT ATGPGPDGHV VGAHLEGPFL APARRGAQPG EHLRGPDRGV FAELVAAGAG
TLRVITLAPE LPGAGAVTEA ALAAGVIAAA GHTDATYDEA ASGFAAGMTL ATHLFNGMRP
LHHREPGPAG AALDAGVACE LINDGVHVHP ALLRLVAAEP ARLVLVTDAV DAAGVGDGDY
LLGGHPVRVR DGQARLAATG ALAGSTLTMD LAVRRAVAAG LALEVAVAAA ATNPARVLGL
AHRCGSIAPG LDADLVVLDA DLRVTRVMAA GRWVPGPATR PIAAG