Gene Gdia_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1887 
Symbol 
ID6975310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2103384 
End bp2104343 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content66% 
IMG OID643391413 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002276262 
Protein GI209544033 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.50224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0288593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAT TGCTGCGCGT GTTGACAGGT CTGGCGGTCC TGTCCCTGTC CGTCCTGTCT 
CCAGCCCGTG CCCAGGTGCC GGACACGGTG ACGGTCATTC TCGACTGGTT CCTGAATGCC
GATCACGCGG CGCTTCTGGC GGCCGACTAT AGCGGCGCGT TCCGCCGGCA TGGATTGCAG
GTGCACCTGA TCGCCCCGTC CGATCCCGGG TCCCCCGCGC GGCTGGTGGC GGCGGGGCAG
GCGGATCTGG CGGTGTCCTA CGAAACGCAG CTGGGCATGC TGGCCGAGCA GGGGATTCCG
CTGGTGCGGG TGGGCACGCT GATCGACACG CCGCTGGATA CGCTGATCAC CGGGCCGGAC
ATTCATTCCC TGAAGGACCT GAAGGGCAGG ACGATCGGGA TTTCCATGGC GGGGGTCGAC
GACGCGGTGC TGGCGGCCAT GCTGGGGTCG GTCGGGCTGT CCCTGTCCGA CGTGCATCAG
GTCAACGTCA ATTTCCAGTT GGAACAGGCC CTGATGTCAC ACGCGGTCGA TGCCGTGATC
GGGGCGACGC GCACCTATGA ACTGATCGAC CTGCGGCAGA AGGGATTCGC CCCCGGCGCG
GTCTATCCCG AGGAACATGG CGTGCCGCTG AATGACGAAC TGATCTTCCT GGCCGCGCGC
GACCATGCCC ATGACCCCAG GATCGTCCGC TTCATGGACG CGCTGGAGGA GGGGACGAAC
GTCCTGCTGA ACCATCCGGA CGATATCCTG GCCCAGGCCG TCCGGGAGCA TCCCGAACTG
GATACGAAGC TGAACCGTGC CGCCTGGACG GCGACCCTGT CGCGCGTCTG CAAGCAGCCC
TCGGTCCTGA ATGCGCGGCG CTATCGGGCG TTCATGGCAT TCCTGCGTGC CCGCGGCGTG
GTGCATCGGG ACATGAACCT GTCCGACTAC GCCGTCGATC CGGCTGACGG CACGCCGTAG
 
Protein sequence
MTGLLRVLTG LAVLSLSVLS PARAQVPDTV TVILDWFLNA DHAALLAADY SGAFRRHGLQ 
VHLIAPSDPG SPARLVAAGQ ADLAVSYETQ LGMLAEQGIP LVRVGTLIDT PLDTLITGPD
IHSLKDLKGR TIGISMAGVD DAVLAAMLGS VGLSLSDVHQ VNVNFQLEQA LMSHAVDAVI
GATRTYELID LRQKGFAPGA VYPEEHGVPL NDELIFLAAR DHAHDPRIVR FMDALEEGTN
VLLNHPDDIL AQAVREHPEL DTKLNRAAWT ATLSRVCKQP SVLNARRYRA FMAFLRARGV
VHRDMNLSDY AVDPADGTP