Gene Ddes_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDdes_1888 
Symbol 
ID7285603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 
KingdomBacteria 
Replicon accessionNC_011883 
Strand
Start bp2276364 
End bp2277404 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID643582709 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002480462 
Protein GI220905150 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCC GGTTAGTTAT CATAGTTTTT CTGGCATTTT TGTGCCTTTC GCCTTTGTGG 
CTGTCACAGG GCGACGCCCG CGCTGATGAT GGGGAAAGCC TGCGCCAGGT GCAGACGGCA
CTGGGCAAAA ACGATTACGA TGAGGCCGTG CGCCTGCTCA AGCCGCTTGT TGACGGCGGC
AATGCTGAAG CCCTGTACGT TATGGGCCGT CTTATTCTGG ACGGCAAGGG CGTGAAGAAA
AACCGCACCC GTGCGGCGGA GTTTTTTCGC CTGGCTGCGG AAAAGGGCGA CGTGAGCGCC
ATGAACTCCT GGGCCACAGC CTTGGCCTCG GGCGACGGTG TGCCGCGCAA CTACCGTGAG
GCCGCGCGCT GGTTCCGCAA GGCGGCCGAA CAGGGGCTGG CCATGGCCCA GTACAACCTT
GGTTACCTCT ACGCCCACGG GCGCGGCGTC AGCAAGGATG AGGCCGCCGC CATTGACTGG
TACAGCCGTG CCGCCAATCA GGGCCTTGCA TCGGCCCAGT ATTCCCTGGG CTGGACCTAT
CTGAACAGCA AGGGTGAAAA CCAGAGCGAC ACCAAAGCCG CCCACTGGTT TGAAAAAGCC
GCGGAGCAAG ATCACCCCAA GGCGCAGAAC AATCTGGCAT TCATGTACGC CGAGGGACGG
GGCTATGCCC AGGACCCGGC CAAGGCCGTG CAGTGGTACA CACGCGCTGC CGAACAGGGC
TATGCCGAAG CCCAGTATAA CCTTGGCTTT ATGTACGAAC AGGGCCGCGG CGTGCCGCAG
GACTATAACC AGGCCGTGGA CTGGTACCGT AAGGCTGCGG AGCAGAACGA GGCCGCCGCG
CAGTACAGCC TGGGACTCAT GTATGATCAG GGAACCGGCG TGCCGCGCAA TCTGAGCGAG
GCCAACCGCT GGTACAATCT GGCCGCCAAG AATGGCGACC CCGATGCCCG ATCCGTGGTG
CGCGCCCAGA ACAACAAGCC GCAGCAGGCG CGCAAGGCCG CTCCGGCAAA CCGGCAACAG
AAGCGCGATA AAAAGCAGTA G
 
Protein sequence
MKIRLVIIVF LAFLCLSPLW LSQGDARADD GESLRQVQTA LGKNDYDEAV RLLKPLVDGG 
NAEALYVMGR LILDGKGVKK NRTRAAEFFR LAAEKGDVSA MNSWATALAS GDGVPRNYRE
AARWFRKAAE QGLAMAQYNL GYLYAHGRGV SKDEAAAIDW YSRAANQGLA SAQYSLGWTY
LNSKGENQSD TKAAHWFEKA AEQDHPKAQN NLAFMYAEGR GYAQDPAKAV QWYTRAAEQG
YAEAQYNLGF MYEQGRGVPQ DYNQAVDWYR KAAEQNEAAA QYSLGLMYDQ GTGVPRNLSE
ANRWYNLAAK NGDPDARSVV RAQNNKPQQA RKAAPANRQQ KRDKKQ