Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3042 |
Symbol | |
ID | 5734914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3842848 |
End bp | 3843999 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280186 |
Product | hypothetical protein |
Protein accession | YP_001545808 |
Protein GI | 159899561 |
COG category | [C] Energy production and conversion |
COG ID | [COG1600] Uncharacterized Fe-S protein |
TIGRFAM ID | [TIGR00276] iron-sulfur cluster binding protein, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00069817 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACA ATCTTGCTGA GGCAGTTCGA GCACAGGCTG CTCAATTAGG TTTTAATCTG GTCGGTATCA CGCCTGCCAA GCCATCACCA ACTTTAGCTG CTTATCAAGC TTGGGTTGAG GCCGGAATGT ATGGCGAGAT GGGCTATTTG GCGCGGCCTG ATCGCCAAGC GCGTCGCCAA GATTTGAATG TGATTGTGCC GAACGTGCGT TCGTTGATTA TCGTGGCGCT GGATTATCGC ACTCAGCCGA TTCCTGCCAG TTTGCTGAAT GACCCCAGCC GTGGCCGAAT TGCGGCCTAT GCTTGGGGCA TCGATTACCA TGATCTGATG ACCCCACGGT TACAAGAACT GGCGAATTGG CTGCAAGCCC AAATTGCCGA GCCAGTTCAG CAGCGGGTCT ATGTCGATAC TGGGGCGATT TTAGAGCGTT CGCATGCCCA GCAGGCTGGC TTGGGCTTTA TCGGCAAAAA TACCATGCTG ATTAGCCCAC GGCGTGGCTC CTTCTTCTTC TTGGGCGAAA TTCTGACGAC CTACGAATTT GCTGATTACG ATCAGCCTGC CCCGCCGACG ATGTGTGGCT CGTGTAGCCG CTGTTTGCAA GCCTGCCCAA CCAAGGCCTT CCCCAAACCG CATGTGCTCG ATGCTCGGCG CTGCATTTCC TACCTGACAA TTGAATATAA AGGCTCAATT GCGCGTGAAC TCCGCCCGCA AATGGCCAAT TGGATTGTTG GTTGTGATGT TTGTCAGGAT GTTTGTCCGT GGCAGCGTTT TGGTGTACAA AGCCAAGAAT CAGCCTTTTT TCCAATTGAT CATGATCGGG CAGCTCCGCC CTTGGCCAGC CTTTTGACCT TAGATCCTGC TGGTTTTGGT GAGCGCTATG GCGAGGCCGC GATTAGTCGG CTCAAGCGTG ATCGTTTAGT GCGCAACGCC TGTGTGGCGG CAGGCAACTG GCGCGACCCC GCAATTTTGC CCTTGCTAGC GCCCTTGTTG CACGATGCCA GCAGCCTTGT GCGTGAGCAT GCCGCTTGGG CGATTGGCCG CAATTTCGAT GATTCATCTG TGATGTTGCT GCAACAAGCC TTGCAAACTG AAACCGAGCC AAGCGTGCGT AACGAATTGC AACAGAGTTT GCACGAGCGC TCAGCTTGCT AG
|
Protein sequence | MTNNLAEAVR AQAAQLGFNL VGITPAKPSP TLAAYQAWVE AGMYGEMGYL ARPDRQARRQ DLNVIVPNVR SLIIVALDYR TQPIPASLLN DPSRGRIAAY AWGIDYHDLM TPRLQELANW LQAQIAEPVQ QRVYVDTGAI LERSHAQQAG LGFIGKNTML ISPRRGSFFF LGEILTTYEF ADYDQPAPPT MCGSCSRCLQ ACPTKAFPKP HVLDARRCIS YLTIEYKGSI ARELRPQMAN WIVGCDVCQD VCPWQRFGVQ SQESAFFPID HDRAAPPLAS LLTLDPAGFG ERYGEAAISR LKRDRLVRNA CVAAGNWRDP AILPLLAPLL HDASSLVREH AAWAIGRNFD DSSVMLLQQA LQTETEPSVR NELQQSLHER SAC
|
| |