Gene Athe_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1714 
Symbol 
ID7409229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1803490 
End bp1804674 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content38% 
IMG OID643716090 
Productcysteine desulfurase NifS 
Protein accessionYP_002573581 
Protein GI222529699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.84465e-07 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGAA AGATTATTTA TTTTGACCAT GCAGCCACAA CCCCTCTTAA AAAGGAAGTA 
TTAGATGAAA TGATGCCGTA TTTGACAGAT CAGTACGGCA ATCCTTCAAC AATTTACAAG
CTTGGAAGAG AAGCAAAAAA AGCCATTGAA CTTGCAAGAG AAAGGGTCGC AAAGGCCTTA
AATGCTGATA TTCAAGAAAT TTACTTTACT TCCGGTGGAA CAGAATCAGA TAACTGGGCA
TTAAAAGGAG TTGCTTTTGC AAATAAAGAT AAAGGCAAGC ATATTATAAC AACAACAATC
GAGCACCATG CGGTTTTGCA TCCTCTAAAA TATCTTGAAG GTTTAGGATT TGAAGTAACA
TATGTTCCTG TTGAGCCAAA TGGTATTGTA GACCCTCAAA AAGTCAAAGA GGCAATAAAA
AATGGCACTA TTTTGATTTC TGTCATGCTT GCAAATAACG AAATTGGGAC AATCCAGCCT
GTCAAAGAGA TAGCAAAGAT AGCAAAGGAA AAGGGAATAA TCGTTCATAC TGATGCTGTT
CAAGCAGTTG GGCAAATTCC TGTTGATGTA AAAGATTTGG GTGTCGACCT TTTATCACTT
TCTGCTCATA AATTCTATGG GCCAAAAGGT GTTGGTGCAC TTTATATCAG AAAAGGGACA
AAGATTCATC CATTTTCGCA TGGAGGTGCA CAGGAGAAAA ATAGGCGTGC TGGAACAGAG
AATGTAGCAG GGATTGTTGG ACTTGGCAAG GCTATAGAGC TTGCAACTCA GAATCTTTCT
GAGTATGCTG CAAAGCTTCA AAAACTGAGA GATAAGCTCA TTGACGGGGT TTTAAGCAAA
ATTGATTATG TTCGACTAAA TGGTGATAGA CATCAGAGAC TTCCTAACAA TGCAAACTTC
TCATTTGAGT TTATTGAAGG TGAAAGCCTG CTTTTGATGC TTGACATGAA AGGAATTGCA
GCATCAAGCG GGTCAGCATG TACATCAGGG TCTTTGGACC CTTCACATGT GCTTCTGGCA
ATTGGACTTG AACATGAGGT TGCTCATGGA TCTTTGAGAA TAACACTTGG TGAAGATAAC
ACCGAAGAAG ATATAGATTA TCTATTAGAA GTTTTGCCTG AAATTGTTTC AAGATTAAGA
GAAATGAGTC CACTTTATGA AAGCGTAAAA AAAGGGGGTA ATTGA
 
Protein sequence
MEGKIIYFDH AATTPLKKEV LDEMMPYLTD QYGNPSTIYK LGREAKKAIE LARERVAKAL 
NADIQEIYFT SGGTESDNWA LKGVAFANKD KGKHIITTTI EHHAVLHPLK YLEGLGFEVT
YVPVEPNGIV DPQKVKEAIK NGTILISVML ANNEIGTIQP VKEIAKIAKE KGIIVHTDAV
QAVGQIPVDV KDLGVDLLSL SAHKFYGPKG VGALYIRKGT KIHPFSHGGA QEKNRRAGTE
NVAGIVGLGK AIELATQNLS EYAAKLQKLR DKLIDGVLSK IDYVRLNGDR HQRLPNNANF
SFEFIEGESL LLMLDMKGIA ASSGSACTSG SLDPSHVLLA IGLEHEVAHG SLRITLGEDN
TEEDIDYLLE VLPEIVSRLR EMSPLYESVK KGGN