Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3597 |
Symbol | |
ID | 5735458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4527168 |
End bp | 4528388 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280746 |
Product | SH3 type 3 domain-containing protein |
Protein accession | YP_001546361 |
Protein GI | 159900114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.400184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAAG TTGCCTTAAC ACCACGTTCT CAGCCAACGA CTGTTTGGCA GTTACGGCGC AAAAAAGTCC CCGATCAACT TTCGTTGCCT GCTGATTATG TTCCAACTGA CTATACGTCG TTGCCAACTC CCGAAGAGCC AAAAAAAGGT GGCGGTCTAG GGATGATTTT GGTTGTGGTG GTCTTGTTCT TGGCGTTGGG GGCAATTGCC TATGCTGTTT GGGGGCAGGG TAGCGGCGAG GAAGATGTTG TTCCAACCAA AGTGCCAACC ACCGCGCTAA CGATTGATCG GGCTAGTATG ATCGTGAGCG ATGGCAAATT GACGGCTCAA ATCTCCATTA CCACAGATGC ACCTGATGAG AGTCAAGTGG GAGCGATCTT ATTGGAAGAT GGCCGCCCAT TTGAATTTTT TGATACCGAT GCCATGACCA CCACTGTCAG CGGCGGCAAG GCTCGCTTGA TTATTCCTGA AATGGAAGGT CACGAAGATG GCCGCAAATC GTCGGAATAT ACTGTCCAAG TAACCGTTGC GAGCGCTGAT GGCGATGTTC TAGCCAAAGC CGACGAAGCG ATTGAAATCA AGGGCGATGC GCTTGATCGT TTCTTGGGCG ATACCGCAGT AACTCCAATC GATGTAACCC CAACCATCAC TGATCCAATT TCAGGCACGA TGGTACCAAC GCCTGCTGAT GGCTCGACTC CAGTAACCCA AGTTACGCCA GTTCAGCCAA CTGCCGCACC TGGTGGCCTG CCCGTGCCGT TGGATAATGT GGTGATCAGC CGCCCAGGCA TTGTCTATAC CACGCCGTTT GGCCCAGCCA ACCAACGTGG CAACGTAACT GCTGGCGAAA TTGCGCGAAT TGTCGTCAAG ATGCCAGTGA ATGGTGAAGT TTGGTATTTG GTCGCGATCA GCCAAAGCGG TCAATCAGGC TGGCTCAATA GCAGCACGAT TGACTTACCT GCAACTGAAG TCAACAAAAT TACCCCTGTT AGCGGCGATG CACCATTTGC CGTAGCCTTC AATGGCGGTA ATGTGCGCTC AGCACCTGGG GGCGATGTTT TGACCCAAGT TGATGCTGGG GTCAATGTCT CGCTGATCAA CCGCAGCAGC GATAGCGCTT GGTTCAAAAT CAAGTTACCA AATGGTAGTG AAGGATGGGT CGTTGGTCAG ATCTTGACTA TCAACCCTGC GGTATTAAAT ACCATCCCTG TAGCACCCTA A
|
Protein sequence | MSQVALTPRS QPTTVWQLRR KKVPDQLSLP ADYVPTDYTS LPTPEEPKKG GGLGMILVVV VLFLALGAIA YAVWGQGSGE EDVVPTKVPT TALTIDRASM IVSDGKLTAQ ISITTDAPDE SQVGAILLED GRPFEFFDTD AMTTTVSGGK ARLIIPEMEG HEDGRKSSEY TVQVTVASAD GDVLAKADEA IEIKGDALDR FLGDTAVTPI DVTPTITDPI SGTMVPTPAD GSTPVTQVTP VQPTAAPGGL PVPLDNVVIS RPGIVYTTPF GPANQRGNVT AGEIARIVVK MPVNGEVWYL VAISQSGQSG WLNSSTIDLP ATEVNKITPV SGDAPFAVAF NGGNVRSAPG GDVLTQVDAG VNVSLINRSS DSAWFKIKLP NGSEGWVVGQ ILTINPAVLN TIPVAP
|
| |