Gene Franean1_5989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5989 
SymbolguaA 
ID5674310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7300224 
End bp7301828 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content71% 
IMG OID641244837 
ProductGMP synthase 
Protein accessionYP_001510239 
Protein GI158317731 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0715353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAGT CACCTCGCGC CAGCTATGGC TCCACCGGCG CACATCCCGG TTACGATCCG 
GTTCTCGTGA TCGACTTCGG CGCCCAGTAC GCCCAGTTGA TCGCTCGGCG CGTTCGCGAG
TGCCATGTGT ACTCGGAGAT CGTCCCGTGG GATATGCCGG TAGCCGAGAT CCTGGCCCGG
CGCCCGGCCG CGCTCATCCT CTCTGGCGGG CCGAAGTCCG TCTACTCCCC GGGTGCCCCC
CGGGTCGACC CCGCGCTGTT CGCTGCCGGG GTGCCCGTCC TGGGCATCTG CTACGGCCAC
CAGGTGATGG CGCAGGCGCT GGACGGCACG GTCGCCCGCA CCGGCACCGC CGAGTACGGA
GCCACCCGGC TGCGGGTGGA CGACCCCGGG GTGCTCTTCG ACGGCCTCCC CACCAGCCAG
CAGGTCTGGA TGTCGCACGG CGACTCGGTG ACCGCCGCGC CCCCCGGTTT CCGGGTGACC
GCGTCGACGC CGTCCACCCC GGTCGCGGCG TTCGAGGACC CCACCCGCAG GCTCTACGGC
GTGCAGTTCC ACCCCGAGGT CGTGCACAGC GAGCGCGGGA TGGATGTCCT GCGGCACTTC
CTGCTGGTCG GCGCGGGCTG CCGGCCGTCC TGGACGATGA TCAATATCGT CGAGGAGGCC
GTGACCGCGG TGCGGGCGCA GGTCGGCAAT GGCCGGCTGA TCTGCGGCCT GTCCGGCGGG
GTCGATTCGG CGGTGGCCGC CGCGCTGGTA CAGCGAGCCG TCGGTGACAC CCTGACCTGC
GTGTTCGTCG ACCACGGGCT GTTGCGGGCG GGCGAGGCGG AACAGGTCGA ACGCGACTTC
GTCGCCTCCA CCGGCGTCGA CCTCGTCCAC GTGAAGGCGG CGGACCGCTT CGCCTCCGCG
CTGGCCGGGG TGACGGATCC GGAGCAGAAG CGGAAGATCA TCGGGCGGGA GTTCATCCGC
GTCTTCGAGG AGTCCGCGCG TGAGCTCGAC GCCCGCGCGG AGGCGGAGGG AACGCACATC
GGTTTCCTCG TCCAGGGCAC TCTCTACCCG GACGTCATCG AGTCCGGCTC GCCCACCGCC
GCCAAGATCA AGTCCCATCA CAATGTCGGC GGGCTGCCGG ACGACCTGCA GTTCGACCTG
GTCGAGCCGC TGCGCACCCT GTTCAAGGAC GAGGTCCGCC GGCTCGGTGA GGAGCTCGGC
CTGCCCGAGG ACATCGTCTG GCGCCAGCCG TTCCCGGGCC CGGGGCTGGC CGTCCGCATC
ATCGGTGAGG TCACCCCCGA GCGGCTCGAG ATCGTCCGGG CCGCCGACGC GGTCGTCCGG
GACGAGATCC GCCGAGCGGG GCTCGACCGG GAGATCTGGC AGGTCTTCGC CGTGCTGCTG
GCGGACGTCC GGTCGGTCGG CGTCCAGGGC GACGAGCGGA CCTACGGTTT CCCGGTGGTG
CTGCGCGCGG TGACCAGCGA GGACGCGATG ACCGCCGACT GGGCTCGGCT GCCCTACGAC
CTGCTGGAAC GCATCAGCAA CCGGGTCGTC AACGAGGTTG GGCAGGTCAA CCGGGTCGTC
TACGACATCA CCTCGAAGCC GCCGGGCACG ATCGAGTGGG AGTAG
 
Protein sequence
MNESPRASYG STGAHPGYDP VLVIDFGAQY AQLIARRVRE CHVYSEIVPW DMPVAEILAR 
RPAALILSGG PKSVYSPGAP RVDPALFAAG VPVLGICYGH QVMAQALDGT VARTGTAEYG
ATRLRVDDPG VLFDGLPTSQ QVWMSHGDSV TAAPPGFRVT ASTPSTPVAA FEDPTRRLYG
VQFHPEVVHS ERGMDVLRHF LLVGAGCRPS WTMINIVEEA VTAVRAQVGN GRLICGLSGG
VDSAVAAALV QRAVGDTLTC VFVDHGLLRA GEAEQVERDF VASTGVDLVH VKAADRFASA
LAGVTDPEQK RKIIGREFIR VFEESARELD ARAEAEGTHI GFLVQGTLYP DVIESGSPTA
AKIKSHHNVG GLPDDLQFDL VEPLRTLFKD EVRRLGEELG LPEDIVWRQP FPGPGLAVRI
IGEVTPERLE IVRAADAVVR DEIRRAGLDR EIWQVFAVLL ADVRSVGVQG DERTYGFPVV
LRAVTSEDAM TADWARLPYD LLERISNRVV NEVGQVNRVV YDITSKPPGT IEWE