Gene Acry_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2269 
Symbol 
ID5162353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2499678 
End bp2500922 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content69% 
IMG OID640554188 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001235383 
Protein GI148261256 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.216373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGG CCATCGAGCA GCCCGGCTTC GACATCGCGC GCATCCGCGC CGATTTCCCC 
ATCCTGTCGC AGACGGTGCA TGGCAAGAAA CTGGTGTTTC TCGATAGCGG CGCCTCGGCG
CAGAAGCCGC GCGCGGTGAT CGATGCGATG GTCCGCTCGA TGGAGACACG CTACGCCAAT
GTCCATCGCG GGCTGCACTG GCTGAGCGAG CGCGCCTCGG ACGACTACGA GGCCGCGCGC
GACAAGGCGG CGGCTTTCCT TAACGCGGCG CGGGAGGAGA TCGTCTTCGT GCGCAACGCG
ACCGAGGGGA TCAACCTCGT CGCCGCGACC TTCGGCCGGT CGGCGCTGCG TCCGGGCGAT
GCGGTGGTGG TGAGCGAGAT GGAGCACCAC GCCAATCTCG TGCCCTGGCA GATGCTGCGG
GATTCGCACG GGATCGAGCT GCGCATCGCG AAGATCACCG ATGCCGGCGA GCTCGACTTC
GCCGATCTGG AGCGGCAATT CGCCGATGGG CGGGTGCGGC TGCTGGCGAT CACCCATATG
TCGAACGTGC TCGGCACCTA CACGCCGGTG GAGCGGCTTG CCGCCTTTGC GCACGAGCGC
GGGGCGCGGC TGCTGCTCGA TGGCGCGCAG GCGGTGGTGC ACCGGGCGGT GGACGTCCGT
GCGATCGATG CGGATTTCTA CGTGTTCTCC GGCCACAAGC TCTACGGGCC GTCCGGCATC
GGGGTGCTGT TCGGCAAGCG CGAGCTGCTC GACGCGATGC CGCCCTTCCT CGGCGGCGGC
GACATGATCC GCAGCGTGAG CTACGAAAAA TCGACCTGGG CGGAGCCGCC CTACCGGTTC
GAGGCGGGAA CGCCGGCGAT CGTCGAGGCG GTGGGTCTCG CGGCAGCGAT CGATTATGTG
AACGCGATCG GGTTTCCGGC GATCGCGTCG CATGAGCGGG CGCTGACCGA TCACGCGCTG
GCGACGCTCG ATGCGATCGG CGGCGTCCAT GTGGTCGGGC GGGCGCAGGA CCGCGGCGGG
GTCGTCGCCT TCACCATGGA CGGGGTGCAT GCGCACGACG TCGCCACCCT GCTCGACAAG
CAGGGGATCG CGGTGCGGGC CGGGCATCAT TGCGCCGAAC CGCTGACCCG CCGGCTCGGG
CTCGACAGCA CCGCCCGCGC GACGTTCGGC GTCTATACGA CGATGGAGGA GATCGACGCG
CTGGCGGCGG GGCTGCGGCG CGTGCAGCAG GTGTTCGGCG GATGA
 
Protein sequence
MSAAIEQPGF DIARIRADFP ILSQTVHGKK LVFLDSGASA QKPRAVIDAM VRSMETRYAN 
VHRGLHWLSE RASDDYEAAR DKAAAFLNAA REEIVFVRNA TEGINLVAAT FGRSALRPGD
AVVVSEMEHH ANLVPWQMLR DSHGIELRIA KITDAGELDF ADLERQFADG RVRLLAITHM
SNVLGTYTPV ERLAAFAHER GARLLLDGAQ AVVHRAVDVR AIDADFYVFS GHKLYGPSGI
GVLFGKRELL DAMPPFLGGG DMIRSVSYEK STWAEPPYRF EAGTPAIVEA VGLAAAIDYV
NAIGFPAIAS HERALTDHAL ATLDAIGGVH VVGRAQDRGG VVAFTMDGVH AHDVATLLDK
QGIAVRAGHH CAEPLTRRLG LDSTARATFG VYTTMEEIDA LAAGLRRVQQ VFGG