Gene Noc_2487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2487 
Symbol 
ID3704372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2832448 
End bp2833710 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content49% 
IMG OID637738966 
Productcysteine desulphurases, SufS 
Protein accessionYP_344470 
Protein GI77165945 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01552] prevent-host-death family protein
[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTG CAAACATCGC AGCTGAAGTA CCTATAGCAA GTGTCCCTTT TGAGATTGAG 
CGGGCCCGCG CTGACTTCCC TGTTTTGCAA CAGGAAGTAC ATGGGAAGCC TTTGGTGTAT
CTAGATAATG CGGCGACTAT GCAAAAACCT AGACAAGTTA CTGAGGCGAT TGATCACTAT
TACCGTTGGG ATAATGCCAA TATCCACCGT GCTGTGCATC AACTTAGCGA ACGGGCGACC
CAAGCATACG AGGCGGCGAG GAATAAAGTG CAACACTTTA TTAACGCTGC TCGGCGGGAA
GAAATTGTGT TTGTACGGGG CACTACCGAA GCAATCAATT TAGTCGCCCA AAGCTTCGGC
CGAAGCCGGT TGCAAGCGGG TGATGAAATT CTGCTCTCCC ATATGGAGCA TCACTCTAAC
ATTGTGCCGT GGCAGCTCCT ATGCGAGCAG ACGGGGGCAG TGCTTAAAGT CGTTCCTATC
GATGATACCG GGGAATTGCT TTTAGATGAG TACGAGAGGT GTTTGTCACC TCGCACCCGG
CTGGTAGCCA TGGTGCATGC TTCTAATGTT TTGGGAACTA TCAATCCAGC ACAAAAAATC
ATTGAGCTAG CCCATGCCCG TGGGATTCCG GTGTTGCTAG ATGGGGCCCA GACTGTTCCC
CACATGCCCG TGGATGTCCA GGAGCTGGAC TGTGATTTTT ACGCTTTTTC AGCTCATAAA
ATGGTAGGAC CCACAGGTAT TGGCGTGCTC TATGGCAAGC GCGAATGGCT AGAAGCTATG
CCTCCTTATC AGGGCGGCGG AGATATGATT CTGTCGGTAA GTTTTGATAA AACGCTTTAT
AGCGATCTCC CCTATAAGTT CGAGGCGGGA ACTCCTCATA TCGCTGGAGC CATTGGCCTG
GGTGTAGCGA TAGATTATCT AGAGACCCTA GGGATGGAAA ACATTGCGGC CTATGAGCAG
GAATTGCTTA ACTATGGGAC AGAGGTTCTG GCTCAAGTTC CCGGACTGCG TTTTATTGGC
ACAGCCCAAG AAAAAGTAGG GGTATTATCC TTTGTCTTGG AGGGCGTCCA TCCTCATGAT
ATCGGTACCA TTCTTGATCA TGAAGGTATT GCCATTCGCA CAGGACATCA TTGCGCCCAG
CCGGTAATGG AACGTTTTAA TATTCCCGCA ACGGCAAGAG CCTCCCTAGC ATTTTACAAT
ACTAAGGCTG AAATGGATGT TCTGGCAGCT GGAATTAAGC GGGTAGGAGA GTTATTGGGA
TAA
 
Protein sequence
MTIANIAAEV PIASVPFEIE RARADFPVLQ QEVHGKPLVY LDNAATMQKP RQVTEAIDHY 
YRWDNANIHR AVHQLSERAT QAYEAARNKV QHFINAARRE EIVFVRGTTE AINLVAQSFG
RSRLQAGDEI LLSHMEHHSN IVPWQLLCEQ TGAVLKVVPI DDTGELLLDE YERCLSPRTR
LVAMVHASNV LGTINPAQKI IELAHARGIP VLLDGAQTVP HMPVDVQELD CDFYAFSAHK
MVGPTGIGVL YGKREWLEAM PPYQGGGDMI LSVSFDKTLY SDLPYKFEAG TPHIAGAIGL
GVAIDYLETL GMENIAAYEQ ELLNYGTEVL AQVPGLRFIG TAQEKVGVLS FVLEGVHPHD
IGTILDHEGI AIRTGHHCAQ PVMERFNIPA TARASLAFYN TKAEMDVLAA GIKRVGELLG