Gene Haur_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3884 
Symbol 
ID5735745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4876784 
End bp4877980 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID641281035 
Productaminotransferase class V 
Protein accessionYP_001546646 
Protein GI159900399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00981578 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCGG AAACAATTTA TTTAGATCAT GCAGCGACGA CTGCGACCGA TCCAAGGGTG 
GTTGAGGCCA TGCTGCCCTA CTTCAACACG GCCTATGGCA ATCCCTCGAG CATCTATCGG
CTGGGCCGCG CAGCGCTCGA AGGCGTAGAT GAAGCCCGCG AAACCGTTGC GAGTTTGCTA
GGAGCAAAAC GCAAAGAAAT TGTGTTTACC AGCGGCGGCT CCGAAGCCGA TAATTTGGCG
ATCAAGGGCG TGGCATTTGC TCAGCGTGAT GCAGGCAAAG GCAATCACAT CATCACCAGT
GCCATTGAAC ATCACGCGGT GCTGCATGCA GTGGAATATC TCGAACACTT TGGCTTTGAA
ATCACGATTT TGCCGGTCGA TAGCACGGGT TTGGTTGCCG TGGCCGATTT ACGGGCCGCG
ATTCGCCCAA CCACGGTGTT GGTCAGCATT ATGGCCGCCA ACAACGAGAT TGGCACGATT
CAGCCAATTG CCGAATTGGG CGCGGTTTGT CGCGAGCACA ATGTGCTGTT TCATACCGAT
GCCGTGCAGT TGATCGGGGC GCAACCAATT AATGTTAAAG AATTGAATGT TGATTTGTTG
AGCCTAACTG CGCATAAATT TTATGGTCCC AAAGGCGTAG GCGCGTTGTA TATGCGGCGC
GGCGTACCCT TGCTACCGTT GATTAATGGT GGCTCACAGG AACGGCGGTT ACGCGCTGGC
ACCGAAAATG TGCCTGGGAT CGTTGGGCTA GCCAAAGCCT TGCAACTTGC CGTCGATGAA
TTGCCACAAA GCAGCAACCA ACTAACCAGC CTGCGCGATC GGCTGATTAG CGGAATTGAG
GCAGCAATCC CGCATGTCTA TTTAAATGGC CATCGCAGCC AGCGTTTGCC CAATAATGTC
AACATGTCGT TTGATTTTAT TGAGGGCGAA AGCATGTTGT TGTTGCTTGA TCAGCAGGGC
ATTTATGCCT CGAGTGGCTC GGCCTGCACC AGCGGTTCGC TTGACCCATC GCATGTTCTG
ATGGCCTTGG GCTTGAGTGC CGAACGCGCT CATGGCAGCC TGCGCATGAC CCTTGGCCGC
GAGAACACCG CCGAGCAAAT CGAGCGCGTC TTAGCATTGT TGCCGCCAAT CGTCGAGCGC
TTGCGGGCAG TTTCGCCGAT GTATCGCCAT TTCTTGGCGG AACAAACTGT TTATTAA
 
Protein sequence
MAPETIYLDH AATTATDPRV VEAMLPYFNT AYGNPSSIYR LGRAALEGVD EARETVASLL 
GAKRKEIVFT SGGSEADNLA IKGVAFAQRD AGKGNHIITS AIEHHAVLHA VEYLEHFGFE
ITILPVDSTG LVAVADLRAA IRPTTVLVSI MAANNEIGTI QPIAELGAVC REHNVLFHTD
AVQLIGAQPI NVKELNVDLL SLTAHKFYGP KGVGALYMRR GVPLLPLING GSQERRLRAG
TENVPGIVGL AKALQLAVDE LPQSSNQLTS LRDRLISGIE AAIPHVYLNG HRSQRLPNNV
NMSFDFIEGE SMLLLLDQQG IYASSGSACT SGSLDPSHVL MALGLSAERA HGSLRMTLGR
ENTAEQIERV LALLPPIVER LRAVSPMYRH FLAEQTVY