Gene Gdia_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1438 
Symbol 
ID6974847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1602540 
End bp1604093 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content57% 
IMG OID643390969 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002275833 
Protein GI209543604 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.262723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.636598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCTT TTCGAAACAT TTTCGACAGA AACACGTACT TGCCGGGAAA CCGATTGTAC 
GGAAAAGGAA AGCGCTACCT GGCATCGGAT CAAATTGCGG ATTGGGTAAA AGCCGCGAAG
TGCTTCTTAG GAGCTGCGAA CAGAGGCAGA CCGGAAGCCC AGTATCAGCT TGGACTCTGC
TATCTCGAAG GCAAAGGCGT GCCGCGCAAC CGAGCACAGG CTTTATGCAG GCTGACTGCA
GCGGCCGATG CCGGAATTGC CGAGGCTATG GCCAAGCTTT CCTTTCTGTA TCTCGAAGGG
CTTCCCGACT TCGAGGATGC TTCGCATGTT GCTCGACTGG TTAAAGATGA GCGGCATGGC
AGAAACAATG TTGACACTGA CAAGGCTCTT TATTGGGCAA ATAAGGCGAT CGCGGGCGGC
GTGGCAGAAG GGCACGTCTG CCTTGGTTAT ATCTATTCGT TAGAGAGGTC AGCCTATAAG
GATATCGATA TGGCCATAAA GCATTATAGG GCTGCGATGG AGGCGGGAAG CCGCAAGGCA
GGACTTGGTC TTGGACAAAC GCTCCTACGG TTTCGGAGTG CGGGCCTTCC TGCTCTCAAG
GAGGCGGAGA AGGCGCTTCT TTTTGCCATG GAGGAGAAAA GTCCCGTTGC CGCATATCTT
CTGGGTGGAT TGTACGAAGT CGGATCCGGA GTGGAAAAAG ACCTGTCCAG AAGCCGGGAA
TTGTACCGGC AGGCCGCCGA AGGCGGCGTG ATCAAGGCCA TGATGCGCCT GAGTACTTTT
CTGCTCGACG GCCATGGTGG TCCCCCCAAT CGAACAGCAG GGGAAACCTG GTTGAGACGC
GCAGCACTCA AGGGAGACTG GAGCGCCTGC CGTCTTCTGG GCGAAATGTA TGCCGCCAGA
CACCATGCAT ATGAAATGAG GCGATGGTAT GAAATAGGAG CCGGACATGG CGATCCGGTT
TCGATCTTCA GGCTGGGAGA GGCCATCGAG CGTCTTCAGG AGCCGGGCGG AACGTCCATC
GAGGCCGTAG AGTGTTATCT CCTGGCTCTG CAACATGGAT ATGAACCGGC CGTGCGCAAG
CTTGCCGCGA GCGTACGGAA TCCGGGCGAA TTCCAGGCTC GCGTCCTCGC GAAGCTCAAC
GCCCTATGCG CCCAGGATAT TCCGTTGGCG TATCTTGCCC TGGCGCTGTG CATAAGATCC
TTGAAGGGGG GTGACATGGT CATGGCCAGA GAACTGTTCA TCAAGGCTTC TAAGGGGGGA
ATCGTGACGG CTCACGTCGC TGCGGGCCAA ATGATCATGA ACGGTCTTGG AGGCAAGGCG
GACGCCGGCC TCGCTCGGGA GTTTTTTTTC AAAGGCGCAA AGGCCGGTCA CATCGGCGCC
ATGTATGCAC TCGGGAGTTT TCATCACAAG CGAGGCGCAA TCGGGTCGAA TATGGCACAG
GCCGCGCTCT GGTATCGCCG AGCGGCATCT GGTGGCCATA AGCATGCGCG GCGCATACTT
AGCCGGTCAT CGCCTTGTGG CACGGAGGAG GCGCCGTTAC GGGAGCTTAC ATGA
 
Protein sequence
MVSFRNIFDR NTYLPGNRLY GKGKRYLASD QIADWVKAAK CFLGAANRGR PEAQYQLGLC 
YLEGKGVPRN RAQALCRLTA AADAGIAEAM AKLSFLYLEG LPDFEDASHV ARLVKDERHG
RNNVDTDKAL YWANKAIAGG VAEGHVCLGY IYSLERSAYK DIDMAIKHYR AAMEAGSRKA
GLGLGQTLLR FRSAGLPALK EAEKALLFAM EEKSPVAAYL LGGLYEVGSG VEKDLSRSRE
LYRQAAEGGV IKAMMRLSTF LLDGHGGPPN RTAGETWLRR AALKGDWSAC RLLGEMYAAR
HHAYEMRRWY EIGAGHGDPV SIFRLGEAIE RLQEPGGTSI EAVECYLLAL QHGYEPAVRK
LAASVRNPGE FQARVLAKLN ALCAQDIPLA YLALALCIRS LKGGDMVMAR ELFIKASKGG
IVTAHVAAGQ MIMNGLGGKA DAGLAREFFF KGAKAGHIGA MYALGSFHHK RGAIGSNMAQ
AALWYRRAAS GGHKHARRIL SRSSPCGTEE APLRELT