Gene Gdia_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3002 
Symbol 
ID6976436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3279871 
End bp3281421 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content67% 
IMG OID643392510 
ProductRND efflux system, outer membrane lipoprotein, NodT family 
Protein accessionYP_002277347 
Protein GI209545118 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGC CATTTTCCCT GATGTCCCGC CGGGGCGCCC TGCATCTGGG CGGCATTCTG 
GCACCGCTGG CGCTGGCCGG CTGCATGGTC GGGCCATCCT ACCACCGCCC CGCCGCCCCG
GTTTCGGCCC GCTTCAAGGA ACTGACCCCG GCCGCAGGCT GGGAACGCGC CCGGCCCGCC
ATGGCCGAAC TGCCCAAGGG CGCCTGGTGG ACGATCTATA ACGATCCCGT CCTGAACCGA
CTGGAATCGC AGGTCGCGAT CTCGAACCAG AACGTGCGGA TGTACGAGGC CAACTACCGC
CAGGCCCGCG CCATGATCGA CAGCGTGCGG GCACAGCTTT TTCCCACCCT CAGCGGCAGC
CTGGGGTTCA ACCGCAACAG CCAGGGGCGT GGCTCGCGCT CGGCGTCCAC CGGCAGCCTG
GTCAGCTACG GGGGGGCGGC GACCAACACG ACCGAGACGA CCTATTCGAT GGGCCCCAGC
GCAAGCTGGG ACCTGGATCT GTGGGGCCGC ATCCGCCGCA ATATCCAAAG CCAGGTGACC
GAGGCCCAGG CCAGCGCCGC CGACCTGGCG AATGCCACCC TGTCCTACCA GGCGCAGCTC
GCCACGGCCT ATTTCAACCT GCGCTACCAG GATTCGCTGC AGGAGCTGCT GCGCCGCTAC
GTGGATTTCA ACACCCAGGC GCTGCAGATC ACGCAGAACC AGTACGAGGC CGGCACCGCC
GATCCGACGG CGGTGATGCA GGCCCGCACC CAGCTTGAAC AGAATCGTGC CACGCTGATC
CAGGCCGGCA TCGCCCGCGC GCAGTACGAA CACGCCATCG CGGTGCTGAT GGGCCGTCCG
CCCGCCGACC TGACGATCAC GCCCGGCACG CTGCCCCGCC AGATCCCCGC CATTCCGGTC
GGGGTCCCCG CCGACCTGCT GCAGCGCCGC CCCGACGTGG CCGCCGCCGA ACGGGCCATG
GAACAGTACA ACGCCCAGAT CGGCGCCGAC ATGGCGGCGT TCTTCCCCGA CGTCACGCTG
ACGGCCGACT ATTCCTACAG CGGCGACCCC ATCGGGCAAC TGGTCCAGGT GGTCAACCGG
ATCTGGTCGC TCGGCGCCTC GGCCTCGGAA GTCCTGTTCC AGGGCGGGTC CCGCATGGCG
GCGGTGCACG GCGCCAACGC CCAGTACGAC AGCGCCGTCG CGACCTACCG CCAGACCGTG
CTGACGGCCC TGCAGAATAC CGAGGACAAT CTGTCCAACC TGCGCATCCT TGAACAGCAG
GCAACCCAGC AGCAGATCGC CCTGGATTTC GCCAACCAGG CCGTCCAGGT CTCACTGAAC
CAGTACGAGG CCGGAACCCA GATCTACACC ACGGTCATCA CCAACGAGAC CACGGCCCTC
AGCAATGCCG AATCCGCCCT GTCCATCCAG CAGCAGCGCA TGGTCGATTC CGTGGCGCTG
GTGCAGGCGC TGGGCGGCGG GTGGAACGCC ACCAGCCTGC CGTCGAAGGC TTCGATGCAG
ACCGACAATC CGTTCCTGCC CTCCTTTATC CAGAAGGACA AAAACCAGTA G
 
Protein sequence
MTSPFSLMSR RGALHLGGIL APLALAGCMV GPSYHRPAAP VSARFKELTP AAGWERARPA 
MAELPKGAWW TIYNDPVLNR LESQVAISNQ NVRMYEANYR QARAMIDSVR AQLFPTLSGS
LGFNRNSQGR GSRSASTGSL VSYGGAATNT TETTYSMGPS ASWDLDLWGR IRRNIQSQVT
EAQASAADLA NATLSYQAQL ATAYFNLRYQ DSLQELLRRY VDFNTQALQI TQNQYEAGTA
DPTAVMQART QLEQNRATLI QAGIARAQYE HAIAVLMGRP PADLTITPGT LPRQIPAIPV
GVPADLLQRR PDVAAAERAM EQYNAQIGAD MAAFFPDVTL TADYSYSGDP IGQLVQVVNR
IWSLGASASE VLFQGGSRMA AVHGANAQYD SAVATYRQTV LTALQNTEDN LSNLRILEQQ
ATQQQIALDF ANQAVQVSLN QYEAGTQIYT TVITNETTAL SNAESALSIQ QQRMVDSVAL
VQALGGGWNA TSLPSKASMQ TDNPFLPSFI QKDKNQ