Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1970 |
Symbol | |
ID | 5733859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2415151 |
End bp | 2417355 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279114 |
Product | hypothetical protein |
Protein accession | YP_001544741 |
Protein GI | 159898494 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTTG GTCGTTCATG GATGATCATC GTTGCAGTTA TGATCATTGC GCTGCTCAGT GTGATCACTG CAACCGCCAT GGTCAGCCCC AGTGTCAGCT CATCGGATGG GCTTTGGCAG AATGTTGCTG AACAGGACAT TCAACAAAAA GGCTCACGCG AAATCATTCC AGTAGTGTAT CGCACGGTCG CCTTGGATCT TAATTTGCTA CAACAACACT TACGTCAAGT GCCTCAAGAA GCCCAAACCA AGGTGCAAAA ATCTGGCTTT ATGTTAGATT TACCGCTACC AGATGGCCAA TTTGGCAAAT TTCGCGTCGT CGAATCGCCA ATTATGGCTC CCGAATTAGC CGCCAAGTTT CCTGAAATTC GCACCTTCTT GGCCCAATCA GTTGATCAGC CTGCAACCAG CGCTCGGCTT GATATCACGC CACGCGGCTT TCATGGCATG ATCTTGAGCG AATCAGGTCG GATTTTTATC GATCCATATA GCCGCAATGA TACGGCTAAC TATATTGTGT ACGATGCTCG CAATTTTGTG GCCGACCCTA GCAAATTGGC CGAACGGACT GGCAACGATT ACGAGCCAAA TCCATTAGGA AATCCATCGT CGATCATTCC TGAACGCTAC TCGATTGGTG AAACCTTGCG CACCTATCGC TTGGCCATGG CTGCCACTGG CGAATACACC GCATTCCACG GTGGCACGGT CAATGGCGCG ATGGCGGCAA TCGTCACCAG CATGAATCGG GTTAACGGAA TCTACGAACG CGATCTTTCA GTGCGCATGC AATTAATCGC CAACAATGAT CTGATTGTGT ATACCAATGC GAGCAGCGAC CCCTATACCA ACAATAGTGG TGGTACGATG CTTGGCCAAA ACCAAACCAA TTTGACCAAC GTGATTGGCG GGGCCAACTA TGATATTGGC CACGTGTTCA GCACTGGCGG CGGTGGGGTC GCTACTTTAC AATCAGTCTG TTCTTCAGGC AGCAAAGCGC GTGGGGTTAC TGGCTCAGGC TCACCAGTTG GCGATGCCTT CGATGTCGAT TATGTCGCGC ACGAAATTGG TCACCAATTT GGTGGTCTTC ACACCTTCAA TGGCTCAACT GGCAGTTGTA GTGGTGGCAA TCGTTCCAGC AGCGCTGCCT ACGAACCAGG TAGTGGTACA ACCATTATGG CTTATGCCGG GATTTGTGGC TCGGAAAACT TGCAACCAAA CAGCGACTTC GACTTCCACG TCAAGAGCTT AGAAGAAATT TCAGCCTTCA TCACCACTGG CGGCGGCGCA ACCTGTGGCA CAACTCAAGC CACTGGCAAT ACCCCACCAG TCGCCAATGC AGGCAGCGAT TACACCATCC CAGCCAATAC GCCATTCGAA TTGACTGCTA GCGCTAACGA TGCTGAAGAC AATAGTTTGA CCTACGATTG GGAACAATAC GATTTGGGTG CGGCCTCACC ACCAAACACC GATAACGGCA ATCGCCCAAT CTTCCGTAGC TTCAATTCAA CCGCCAGCAA TGTGCGTACC TTGCCAAAAC TGAGCGATAT TTTGAACAAT ACCACGACGA TTGGCGAATC GTTGCCAACT ACTAACCGCA ACCTGACCTT CCGCTTGACC GTGCGTGATA ACCACGCTGG GGCTGGTGGT TATGGCTTGG ATACGGCGGT TCTGACTGTC AACAATACTG CTGGACCATT CTTGGTAACT GCACCAAACA CGGCAATCGC CTGGACAGGC GGCGCGAACG AGTCGGTCAC GTGGAATGTT GCCAACACCA CCGCTGCGCC AATTAGCTGT GCTAATGTTG ATATTTTGCT CTCGAAAGAT GGCGGCACAA GCTTTGAAGC CTTGGTCAGC AACACCCCCA ACGATGGCGA TGAAACTGTT GTTGCGCCAA ACGTTAATGC TGCTGCTGCG CGGATCAAAG TGCGTTGTGC TAACAATATC TTCTTTGATA TTTCTAACGC CAACTTTGCG ATCAACGGGG TCAATATCAC ACCAACTCCA GTTACACCAA CATTAACCCC AACCAATACT CCCACTCGCA CGCCAACATT AACCCCAACT CAAACATCAA CCCCAACCCA AACCGCGACG GCGACCCCAA CAGCTAGCCC AACCGTCAGC GTTACGCCGA CGGCGAGCGT TACACCAACC CCTGAGAACT ATAGCGTTTA TCTGCCTGTC GCAATCAAAA ATTAA
|
Protein sequence | MRLGRSWMII VAVMIIALLS VITATAMVSP SVSSSDGLWQ NVAEQDIQQK GSREIIPVVY RTVALDLNLL QQHLRQVPQE AQTKVQKSGF MLDLPLPDGQ FGKFRVVESP IMAPELAAKF PEIRTFLAQS VDQPATSARL DITPRGFHGM ILSESGRIFI DPYSRNDTAN YIVYDARNFV ADPSKLAERT GNDYEPNPLG NPSSIIPERY SIGETLRTYR LAMAATGEYT AFHGGTVNGA MAAIVTSMNR VNGIYERDLS VRMQLIANND LIVYTNASSD PYTNNSGGTM LGQNQTNLTN VIGGANYDIG HVFSTGGGGV ATLQSVCSSG SKARGVTGSG SPVGDAFDVD YVAHEIGHQF GGLHTFNGST GSCSGGNRSS SAAYEPGSGT TIMAYAGICG SENLQPNSDF DFHVKSLEEI SAFITTGGGA TCGTTQATGN TPPVANAGSD YTIPANTPFE LTASANDAED NSLTYDWEQY DLGAASPPNT DNGNRPIFRS FNSTASNVRT LPKLSDILNN TTTIGESLPT TNRNLTFRLT VRDNHAGAGG YGLDTAVLTV NNTAGPFLVT APNTAIAWTG GANESVTWNV ANTTAAPISC ANVDILLSKD GGTSFEALVS NTPNDGDETV VAPNVNAAAA RIKVRCANNI FFDISNANFA INGVNITPTP VTPTLTPTNT PTRTPTLTPT QTSTPTQTAT ATPTASPTVS VTPTASVTPT PENYSVYLPV AIKN
|
| |