Gene Suden_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_1006 
Symbol 
ID3763908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp1056955 
End bp1058235 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content37% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_393519 
Protein GI78777204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.711145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTTTAA TAGATGGCAA GAGAATAAGT GCTAGTGATT CGCTTATTGG GATGAGTGAT 
TTTGCATATA GCTGGGTTCC TCTTAATGCC ATAAAAAAGA TAGAAGTCAT CAAAGGTCCT
ATGAGCTCAC TTTATGGCTC TAGTGCAATC GGCGGGGTGG TAAATATCAT TACAAAAAAA
CCAACAGATA GTTTTTTTGG AGAAGTTGAT GCAAAGTACG GCTTTAGCAG TGCAGATGGC
GGAGATGAAC AAGACTACTC AATAAATGTT GGAGGCAAGA TAACCGACAA ACTCTCAGCA
ACTATTTTTG CTCAAGCAAT TAAGATAGAG CCTATGAAAG ATGGCACAAA TGTTTTAGCC
AAAAGAGAGG GCAGGGATGT TAAAAATGCT ATTTTAAACC TCTGGTACGA TATAGATGAC
TCACAACAAA TAGCTATCTC ATCCATGAGA GGAGATGAGA TTAGAGATAA TCTGAAATAC
AAAGAGTACT ACGATATTGA AAAAAGTCAT GACTCCATAG AGTATAGAAA ATATTTTGAT
AATGTAAAAA TGAATCTAAA ATATTACATG ACATCACTAG ATGCTCACTC GGATGATACA
AGTCTGCTCT ATACACATAA GATGGATGAT GAGGTCATGA ATGCGGAGTT TGCAATAAGC
GCGATTGATG ATAACTACAT CATACTCGGC GCAGAGAGAA GAGTTGAGAA GTATCATAAA
GCCTATGATT TTACTCCTGC AAAAGATTTT AAAGATGAGA TAGATTACAC ATCGTTTTAT
CTTCAAGATG AGATAGCAGT TGGAGAAAAA ACTCTTCTTA CTCTTGGTGC AAGATATGAC
AAGCATGAGA AGTTTGGAGC AGAGCTAAGC CCAAAAGCAA ATCTTGTTTA TAAACTTGAT
GAATATAACA GATTAAAAGG CGGCTACGGA CATGGCTTCA ACGCTCCCTC TTTAACTCAA
AACTCAGATG ATTATGTCCT TGCATACCCC ATAAATACCT CTTCTATGCC TATGCAGTTT
TACAGATTTA GGGGAAATAG CGCTCTTAAG CCAGAAGTCT CCGATAGCTT TGAAGTTGGG
TATGAGTATG CAAGAGATAC AACCTCATTT AAAGCAACTG TTTTTTATAC AAAAGTGAGT
GATCTCATTA CCTATAAAGA TAACGGAACA ACTGTAGCAA TGCCTATAGC ATATCAGGAA
AAACTATATT CAAATGTTGA TGAAGCGGCT ATTTATGGGC TTGAAGTGGA GTATGAAGAG
AAGGAGATTT TATCTAATTA A
 
Protein sequence
MILIDGKRIS ASDSLIGMSD FAYSWVPLNA IKKIEVIKGP MSSLYGSSAI GGVVNIITKK 
PTDSFFGEVD AKYGFSSADG GDEQDYSINV GGKITDKLSA TIFAQAIKIE PMKDGTNVLA
KREGRDVKNA ILNLWYDIDD SQQIAISSMR GDEIRDNLKY KEYYDIEKSH DSIEYRKYFD
NVKMNLKYYM TSLDAHSDDT SLLYTHKMDD EVMNAEFAIS AIDDNYIILG AERRVEKYHK
AYDFTPAKDF KDEIDYTSFY LQDEIAVGEK TLLTLGARYD KHEKFGAELS PKANLVYKLD
EYNRLKGGYG HGFNAPSLTQ NSDDYVLAYP INTSSMPMQF YRFRGNSALK PEVSDSFEVG
YEYARDTTSF KATVFYTKVS DLITYKDNGT TVAMPIAYQE KLYSNVDEAA IYGLEVEYEE
KEILSN