Gene Gdia_3565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3565 
Symbol 
ID6973371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011367 
Strand
Start bp11661 
End bp12917 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID643393534 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002278352 
Protein GI209542171 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCATT TATCCTTTCA GTATCTTGAG GGTTTTCCCA GCTTTGAAGG TGCTTCGCAT 
TTCGCACACC TAGTCAAAGA TGAGCGCAAA TGTGCGAATA TCTTTGACGT CAAAGCGAGC
GAATATTGGG CCAAGAAGGC AATTGAAGCG GGTGCCGCAG AAGGCCATGT CTGTTTGGGT
TATATCTATT CGTCAGTCGA TCCCGATCGA CGGAATATTG ACCTGGCAAT CGAGCATTAT
GAGACTGCAA TGCAGGCGGG AAATCGTAAA GCTGGATTGG GGCTGGGGCA AATACTTCTA
CAGTATCGGA GCGCCGAGGC TGAGAGCATA AGAAAAGCCG AAAAATCTTT GCTTTTTGCT
ATTGAGGAAA AAAATCCTAC TGCGGCATAT CTTCTGGGCG GATTATATGA AGTCGGATCG
GGCGTGGAGA AAGATCTGTC CAGAAGCAGA CAGTTCTATC AAATCGCTGC GGAAGGTGGC
GTGGTCAAAG CCATGGTTCG TCTCAGCCAG CTCCTATTCG ATGGAGAGGG CGGATCTCCC
GATCAAATAA AAGGGGAGAC GTGGTTGCGG CAGGCCGCCA TTAAAGGAGA TGGCGCGGCT
TGCCGCTTGC TGGCCGGTAT CTATTCCACA AGAGCGCGTG ACGATGAGGC GCGGCGCTGG
TACGAGACTG GAGCGAAGCT CGGCGATCCG ATCGCGGCAT TCAGGCTGGC CGAAGCAATA
GAGAAGCTTC AAGGTTCCGG TGCAATGCCT GTTGAAGCAA TGGAGTGGTA TATTCGGGCG
CTGAAAAGTG GATATGAACC AGCCGCTATT AAATTATGCA CAATGTTGTT CAATAAAGAG
ATGAGGGAAT TTGGAAATAA CATCCTGATG AGAATAGAGG AATTGTCCAA TCAGGGAAAT
CCACTGGCCC ATCTTGTTCT GGCTGCGCAT ATCAGGTCTT CACCTAACGG TGATATGCAT
CATGCCAGAT GTTTATTGAT CAACGCATCC GAAGCGGGCG TGGTGACGGC GCATTCTGCT
GCTGGTGAGA TGATTATGAA AGGTCTCGGA GGTAGGGCAG ACATTGGGCT TGCTATCGGG
TTTTTCCGCA AGGCTGCCGA CGCCGGGCAT CTCGATGCAA TGTATGCATT GGCGACGCTC
TATAAAAATC GCCGGTCTTT GTATTACAAT CATACTAAAG CTGAGTTCTG GACTCGCCGA
GCTGCGGCAG GCGGTCACGT AGCTGCCAAT AATATGCTTA AAGAATTCAA TATTTGA
 
Protein sequence
MVHLSFQYLE GFPSFEGASH FAHLVKDERK CANIFDVKAS EYWAKKAIEA GAAEGHVCLG 
YIYSSVDPDR RNIDLAIEHY ETAMQAGNRK AGLGLGQILL QYRSAEAESI RKAEKSLLFA
IEEKNPTAAY LLGGLYEVGS GVEKDLSRSR QFYQIAAEGG VVKAMVRLSQ LLFDGEGGSP
DQIKGETWLR QAAIKGDGAA CRLLAGIYST RARDDEARRW YETGAKLGDP IAAFRLAEAI
EKLQGSGAMP VEAMEWYIRA LKSGYEPAAI KLCTMLFNKE MREFGNNILM RIEELSNQGN
PLAHLVLAAH IRSSPNGDMH HARCLLINAS EAGVVTAHSA AGEMIMKGLG GRADIGLAIG
FFRKAADAGH LDAMYALATL YKNRRSLYYN HTKAEFWTRR AAAGGHVAAN NMLKEFNI