Gene Gdia_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3520 
Symbol 
ID6976972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3850860 
End bp3852668 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content71% 
IMG OID643393039 
ProductABC transporter related 
Protein accessionYP_002277858 
Protein GI209545629 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCTC CCCTTCTTCT CCTCCAGGAC ATCACGCTCA CGCTCGGCGG CAAGCCGCTG 
CTGGACGGGG CCGGGTTCGG CGTATCCGCC GGCGAGCGGA TCTGCCTGGT CGGCCGCAAC
GGCAGCGGAA AGTCCACCCT GCTGCGCATC GCGTCCGGCG ACCTGCTGCC CGATGACGGC
ACCTGCTTCC TCCAGCCCGG CACCTCGGTC CGCACCCTGC CGCAGGAGCC CGACCTGTCG
GGCTTCGCCA CCACGCTGGA TTACGTCCGC GCGGGCATGG GCCCCGGCGA CCCGGAACAC
CGCGCCGCCC TGCTGCTGGG CGAACTGGGC CTGACGGGGG CCGAGGACCC GGCCAGCCTG
TCGGGGGGCG AGGCCAGGCG CTGCGCGCTG GCCCGGGCCC TGGCGCCCGA GCCCGACCTG
CTGCTGCTGG ACGAGCCGAC CAACCATCTG GACATGCCCA CCATCGAATG GCTGGAACGC
GAATTGCTGT CGCTGTCCTC GGCCATGGTG ATCATCAGCC ATGACCGGCG GCTGCTGGAA
ACCCTGTCGC GCAGCGTCGT CTGGCTGGAT CGCGGCACGA CCCGCCGCCT GGACCAGGGT
TTCGCCCGCT TCGAATCCTG GCGCGAGGAG GTTCTGGAGC AGGAAACCCG CGACGCCCAC
AAGCTGGACC GCCAGATCGC GCGCGAAGAG GACTGGATGC GCTACGGCGT CACCGCCCGG
CGCAAGCGCA ACGTCCGGCG GGTGGGCGAA CTGGCCGCCC TGCGGCAGGC CCGGCGCGAG
GCCGTGCGCA CGCCCGGCGG CCTGAAGATG CAGGCCAGCG AGGACGGGCT GTCCGGCAAG
CTGGTCGCCG TGGCCGAACA GGTGCGCAAG GCCTACGGGT CCCGCGTGAT CGTCGACGGG
CTGGACCTGC GGCTGCTGCG GGGCGACCGT CTGGGCATCG TCGGCGCCAA CGGCGCGGGC
AAGAGTACGC TGCTGCGCCT GCTGACCGGC CAGGACCAGC CCGATTCCGG CGAGATCCGG
GTCGGCACCG CCCTGTCCAC GGTCACGCTC GACCAGCAGC GCCGGGCGCT GGACCCCGCC
CGCACGCTGG CCGATACGCT GACCGGCGGC GGGGGCGACA TGGTCCAGGT CGGGACCGAA
AAACGCCACG TCGTCGGCTA CATGAAGGAT TTCCTGTTCC GTCCCGAACA GGCGCGCACC
CCCGTCGGCA TGCTGTCGGG CGGCGAGCGC GGGCGGCTGA TGCTGGCCTG CGCGCTGGCG
CGTCCCTCCA ACCTGCTGGT CCTGGACGAA CCGACCAACG ACCTGGACCT GGAGACCCTG
GACCTGCTGC AGGAGATGCT GGACGATTAT TCCGGCACCG TCCTGCTGGT CAGCCATGAC
CGCGACTTCC TGGACCGCGT CGCGACGTCG GTGCTGGTGG CCGAGGGCGA CGGGCGATGG
GTTGAATATG CGGGCGGATA TAGCGACATG CTGGCCCAGC GCGGCGGCGT GGCCCCGGCC
GGCCGGGGCC GGCGCCCCGG CACGGAAACG CAGGCCCCCG CCCAGAAGCG TCCGGACAAG
GGACCGGCAC GCAAGCTGTC CTACAAGGAC CAGCATGCGC TGGACCGCCT GCCGGGCCAG
ATCGCGGCGC TGGAGGATGA AATCGGCCGC TTGCGCACCG TTCTGTCGGA TGGTGCGTTG
TATGCCCGCG ATCCGGCGGC GTTCACGGCG GCGACCGGGC AACTGGAACG GGCCGAGGCC
GAACTGACCG CGGCCGAGGA ACGGTGGCTG GAACTGGAAA TGCTGCGCGA AACGCTGGGT
TCGTCCTGA
 
Protein sequence
MAPPLLLLQD ITLTLGGKPL LDGAGFGVSA GERICLVGRN GSGKSTLLRI ASGDLLPDDG 
TCFLQPGTSV RTLPQEPDLS GFATTLDYVR AGMGPGDPEH RAALLLGELG LTGAEDPASL
SGGEARRCAL ARALAPEPDL LLLDEPTNHL DMPTIEWLER ELLSLSSAMV IISHDRRLLE
TLSRSVVWLD RGTTRRLDQG FARFESWREE VLEQETRDAH KLDRQIAREE DWMRYGVTAR
RKRNVRRVGE LAALRQARRE AVRTPGGLKM QASEDGLSGK LVAVAEQVRK AYGSRVIVDG
LDLRLLRGDR LGIVGANGAG KSTLLRLLTG QDQPDSGEIR VGTALSTVTL DQQRRALDPA
RTLADTLTGG GGDMVQVGTE KRHVVGYMKD FLFRPEQART PVGMLSGGER GRLMLACALA
RPSNLLVLDE PTNDLDLETL DLLQEMLDDY SGTVLLVSHD RDFLDRVATS VLVAEGDGRW
VEYAGGYSDM LAQRGGVAPA GRGRRPGTET QAPAQKRPDK GPARKLSYKD QHALDRLPGQ
IAALEDEIGR LRTVLSDGAL YARDPAAFTA ATGQLERAEA ELTAAEERWL ELEMLRETLG
SS