Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3884 |
Symbol | |
ID | 5735745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4876784 |
End bp | 4877980 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281035 |
Product | aminotransferase class V |
Protein accession | YP_001546646 |
Protein GI | 159900399 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00981578 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCGG AAACAATTTA TTTAGATCAT GCAGCGACGA CTGCGACCGA TCCAAGGGTG GTTGAGGCCA TGCTGCCCTA CTTCAACACG GCCTATGGCA ATCCCTCGAG CATCTATCGG CTGGGCCGCG CAGCGCTCGA AGGCGTAGAT GAAGCCCGCG AAACCGTTGC GAGTTTGCTA GGAGCAAAAC GCAAAGAAAT TGTGTTTACC AGCGGCGGCT CCGAAGCCGA TAATTTGGCG ATCAAGGGCG TGGCATTTGC TCAGCGTGAT GCAGGCAAAG GCAATCACAT CATCACCAGT GCCATTGAAC ATCACGCGGT GCTGCATGCA GTGGAATATC TCGAACACTT TGGCTTTGAA ATCACGATTT TGCCGGTCGA TAGCACGGGT TTGGTTGCCG TGGCCGATTT ACGGGCCGCG ATTCGCCCAA CCACGGTGTT GGTCAGCATT ATGGCCGCCA ACAACGAGAT TGGCACGATT CAGCCAATTG CCGAATTGGG CGCGGTTTGT CGCGAGCACA ATGTGCTGTT TCATACCGAT GCCGTGCAGT TGATCGGGGC GCAACCAATT AATGTTAAAG AATTGAATGT TGATTTGTTG AGCCTAACTG CGCATAAATT TTATGGTCCC AAAGGCGTAG GCGCGTTGTA TATGCGGCGC GGCGTACCCT TGCTACCGTT GATTAATGGT GGCTCACAGG AACGGCGGTT ACGCGCTGGC ACCGAAAATG TGCCTGGGAT CGTTGGGCTA GCCAAAGCCT TGCAACTTGC CGTCGATGAA TTGCCACAAA GCAGCAACCA ACTAACCAGC CTGCGCGATC GGCTGATTAG CGGAATTGAG GCAGCAATCC CGCATGTCTA TTTAAATGGC CATCGCAGCC AGCGTTTGCC CAATAATGTC AACATGTCGT TTGATTTTAT TGAGGGCGAA AGCATGTTGT TGTTGCTTGA TCAGCAGGGC ATTTATGCCT CGAGTGGCTC GGCCTGCACC AGCGGTTCGC TTGACCCATC GCATGTTCTG ATGGCCTTGG GCTTGAGTGC CGAACGCGCT CATGGCAGCC TGCGCATGAC CCTTGGCCGC GAGAACACCG CCGAGCAAAT CGAGCGCGTC TTAGCATTGT TGCCGCCAAT CGTCGAGCGC TTGCGGGCAG TTTCGCCGAT GTATCGCCAT TTCTTGGCGG AACAAACTGT TTATTAA
|
Protein sequence | MAPETIYLDH AATTATDPRV VEAMLPYFNT AYGNPSSIYR LGRAALEGVD EARETVASLL GAKRKEIVFT SGGSEADNLA IKGVAFAQRD AGKGNHIITS AIEHHAVLHA VEYLEHFGFE ITILPVDSTG LVAVADLRAA IRPTTVLVSI MAANNEIGTI QPIAELGAVC REHNVLFHTD AVQLIGAQPI NVKELNVDLL SLTAHKFYGP KGVGALYMRR GVPLLPLING GSQERRLRAG TENVPGIVGL AKALQLAVDE LPQSSNQLTS LRDRLISGIE AAIPHVYLNG HRSQRLPNNV NMSFDFIEGE SMLLLLDQQG IYASSGSACT SGSLDPSHVL MALGLSAERA HGSLRMTLGR ENTAEQIERV LALLPPIVER LRAVSPMYRH FLAEQTVY
|
| |