Gene Gdia_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1004 
Symbol 
ID6974401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1131388 
End bp1132548 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID643390526 
Productconjugation TrbI family protein 
Protein accessionYP_002275402 
Protein GI209543173 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGA GCACCGGTGC GGGGGAGGCG GACAAACCGG CGCCTCCGGT CAGGCCGCCA 
GGCGAGATGC GGTTGCGCGT CACGCGACCG CCGGTCACGC GACTGTCGCG CAGGGTGATC
ATGGGGCTGA CGCTGGTGGC GGTGCTCGGT GTCGGGGGCG CGCTCTATGT GGCGCTGAAG
CCCACACCCC GAACGGGCAA GGCCGAACTC TATAATACCG GCAACCGCAC CACACCGGAT
GGACTGGCGA ACCTGCCGGG AGACTATACC CATCCGCCAC GTCTTGGGCC GCCGTTGCCC
GGCGATCTCG GCAGGCCGTT CCTCAAGGCA GGCGTGGGTG CTCCCGGCAT GGCGACACCC
GCCGCACCGG CTGATCCCGA GAAGCAGCGC ATGGCTCAGG AGCAGGAAGC AGCACGGGTC
AGCCACCTGT TCACCCAGAC GCAGGTCGGG CGAAACAGCG CCGCCCCGGT GACGGCCGGC
GCCACCCCGG TTCCGGCAGC CGGTGCTGCC ACCGATCGCA ACCAGGCGTT TCTCGATGGA
CCGGTTGACC AGCGAACCGT CAGCGCCGCG CGTCTGACGA CCGTTCCCAG CCCCTATGTA
ATCCAGGCAG GATCAATCAT TCCCGGTGCG CTGATCACCG GCATCCGCTC TGACCTGCCG
GGTGAGGTGA CGGCCCAGGT GACGGAGAAT GTCTACGACA GCCCGACCGG GCATATCCTG
CTGGTCCCGC AGGGGGCGAA ACTGATCGGC CGCTATGATT CCCAGATCGC CTACGGGCAG
ACGCGAGTGC TGCTGGTCTG GACCCGACTG ATACGCGGTG CGAATGATTC GATTGTGCTG
GAGAACGAGC CGGCTGCCGA TGCGGCAGGG TTTGCCGGGT TGCAGGACCA GACCGACAAT
CATTGGGGCG AGGTGTTCAA GGCCGCGTTG GTCTCGACCG TGCTCAGCGT AGGTTCAGAG
GCCGATATCG GTGGCAGCAA TGGCATCGCC CAGGCGCTGC GCACCGGCGG CTCGCAAGGA
TTTAACCAGA TCGGTGAGCA GATGGTCGGA CGCTCGCTCA ATATCCAGCC CACCAATACG
ATCCGGCGCG GCTTTCCGGT GCGGGTCATG GTGCATCGTG ATCTGGTTCT CTCCCCTTAC
AGACAGGAGG CTTCCCGATG A
 
Protein sequence
MSGSTGAGEA DKPAPPVRPP GEMRLRVTRP PVTRLSRRVI MGLTLVAVLG VGGALYVALK 
PTPRTGKAEL YNTGNRTTPD GLANLPGDYT HPPRLGPPLP GDLGRPFLKA GVGAPGMATP
AAPADPEKQR MAQEQEAARV SHLFTQTQVG RNSAAPVTAG ATPVPAAGAA TDRNQAFLDG
PVDQRTVSAA RLTTVPSPYV IQAGSIIPGA LITGIRSDLP GEVTAQVTEN VYDSPTGHIL
LVPQGAKLIG RYDSQIAYGQ TRVLLVWTRL IRGANDSIVL ENEPAADAAG FAGLQDQTDN
HWGEVFKAAL VSTVLSVGSE ADIGGSNGIA QALRTGGSQG FNQIGEQMVG RSLNIQPTNT
IRRGFPVRVM VHRDLVLSPY RQEASR