Gene Gdia_2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2885 
Symbol 
ID6976317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3159570 
End bp3161351 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content77% 
IMG OID643392393 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002277231 
Protein GI209545002 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.048224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCT TGTCTGCTTT GGGGGCGCGG TTGTCCCTGC GGGGGCGGCT GGCGCATGGC 
GTGCGGCTGA TCGAGGCGGG CGACGCGGTG GGGGGTGTCG CCCAACTGGG CATGGTGGCC
GAGGCCGGGG TGGCGGAGGC CCAGTTTCGT GTCGGGCGGG CCTATCTGGA GGGGACCGGC
GTGCCGCGCA GCCTGGTCGA GGGCGCGCGC TGGATGCGCC GCGCGGCCGA GGCCGGGTGG
GTCGAGGCCC GGCTGGTGCT GGGGACGCTG TATCTGTTCG GCCTGCCGCG CGACGCGGCA
TCGGACCATG GCGTGCGGCT GGCGGCCGAG GCGCCGGGCG ATCGCGTGCC CGACCCGGAC
GAGGCCGCGT ACTGGACGCA CCTGGCGGCC GAGGCGGGCT CCCCCGACGC GCAGGCGCTG
TACGGCTACA TCCTGGCCGA TGGCCCCGTC GGCCTGCGCG ACCGGGTGGC GGCCGAGGGC
TGGTTCGCCC GCGCCGCCGC CGCCGGCTGC GCGCAGGGTC ATCTGGGGCT GGGCCTGACC
TGGATGCGCG CGGCGCTGCG GCACTCGCCG GAGGCCGCGT CCTCGGGCGA GGCCGACCGT
GCGCGCGCCG CCGGCGCGCT GCGCGTGGCC GCCGAGGCCG GGCTGGGCTC GGCGCAGTTC
CTGCTGGGGG CGCTGACCGA GCAGGGCAGC GGCGTCGCCC GCGACGTGGC GGCGGCCACC
GCGCTGTATG CCAGCGCCGC CCGGGCCGGG GTGTGCGCGG CGCAGGCGCG CTACGGCCTG
GCGCTGCTGG AGGGCAGGGG CTGCGCGCGC GACGCCGTGC AGGGCGAAAG CTGGCTGCGC
CGCGCGGCCC TGGCCGGCGA CCGCGAGGCC GCGTCGCTGC TGGGCGACCT GTATGCAAGG
GGCGGCGACC TGCCGCCGAA CGACGTGGAA GCCGCGTCGT GGTACCACCG CGCGGCGGCG
CTGGGCCATG CGGCGGCATG CCGGGCGCTG GGGCTGCTGC ACCTGGCCGG GGGCGGCCTG
CCGCGCGACG CGGCCGAGGC CGCGCGCTGC TTCCGGCAGG CCATGGCGCT GGGCGACGCC
CGGGCGGGTG CCGATCTGTG CAACCTGCTG CTGGCCGACC CGACGCTGAG CGCCGCGATG
TCGGACGACG AGCGGCGCGA GTTAGGCCGT TCCTTCGCCC GCGCGGCGCA GGAGGGGGAT
GCGGTCGCGG CGTTCAATTT CAGCATCTGC CTGGCGCAGG GGCTGGGGGT GGAGCGCGAC
GAAAACGCCG CCGCGTCGTG GATGAAGCGG GCGGCCGACG GAGTGGTGAA CGCCCGGTAC
TGGTATGGCC GCATGCTGCT GGAGGGCCGG GGGCACGCGC CCGACCCGGT GGCCGGCCGT
GCCTGGATCG CGCGGGCCGC CGAGGCCGGC ATGGCCGAGG CGCAGGTGGC GCTGGCGCAA
TTGCTGCTGA CCGGCAACGG CGGCGCCCGC GATCACGTCG GGGCGGCGGA CTGGTATCGC
CGGGCGGCGG AGCAGGGGCA GGTGGATGCC ATGTTCTCGC TGGGCGCGCT GCTGGGCGGG
GGGCATGACA TCCCGATGGA CCGTGTCCAG GCCCAGCACT GGTTCCGCAT GGCCGCCGAA
CGCGGCAACG CGCTGGGGCA GTTGATGCTG GGGCGCTATC TGCTGCGCGG GCTGGCCGGC
ACGATGGACG GCGCGCAGGC CCGGCACTGG CTGGACCGCG CCCGGGCGCA GGGCGTGGCG
GACGCGGCGG CCGAACTGGC GCACATGGAT GGGGCGGGCT GA
 
Protein sequence
MRVLSALGAR LSLRGRLAHG VRLIEAGDAV GGVAQLGMVA EAGVAEAQFR VGRAYLEGTG 
VPRSLVEGAR WMRRAAEAGW VEARLVLGTL YLFGLPRDAA SDHGVRLAAE APGDRVPDPD
EAAYWTHLAA EAGSPDAQAL YGYILADGPV GLRDRVAAEG WFARAAAAGC AQGHLGLGLT
WMRAALRHSP EAASSGEADR ARAAGALRVA AEAGLGSAQF LLGALTEQGS GVARDVAAAT
ALYASAARAG VCAAQARYGL ALLEGRGCAR DAVQGESWLR RAALAGDREA ASLLGDLYAR
GGDLPPNDVE AASWYHRAAA LGHAAACRAL GLLHLAGGGL PRDAAEAARC FRQAMALGDA
RAGADLCNLL LADPTLSAAM SDDERRELGR SFARAAQEGD AVAAFNFSIC LAQGLGVERD
ENAAASWMKR AADGVVNARY WYGRMLLEGR GHAPDPVAGR AWIARAAEAG MAEAQVALAQ
LLLTGNGGAR DHVGAADWYR RAAEQGQVDA MFSLGALLGG GHDIPMDRVQ AQHWFRMAAE
RGNALGQLML GRYLLRGLAG TMDGAQARHW LDRARAQGVA DAAAELAHMD GAG