Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1182 |
Symbol | |
ID | 6743999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1092060 |
End bp | 1093805 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642750991 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002121845 |
Protein GI | 195953555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTG CTTTTAAAAT ACTCAAAAAT CCCTACATTA ACGGTATTTT ATTTACGCTC ATATGTTTTT TGCCTATATA TTTTTCCAAA AGCCCAAAAG TCTCCAACAT ATCTATAAAA ACTGTAGACC TAAGAAAAGC TCATATATCC ATTGGCAAAA CATCTAACAT CGCTTATATC GTTTTAGCTT CAGATCCCAA AACCCTAAAC CCAGTACTAG CTCAAGAGAC ATCTTCCACC GATGTAATCG CCCCTCTTTT TAACGGCCTT ACAAAGATTG ACTTAAAAAC TATGAGCATA AAACCAGAAC TCGCCTCTTC TTGGAAAATA CTAAATAACG GAAAAACCTA CATAATATAT CTTAGAAAAG GTTTAAAATG GTCCGATGGA AAACCCCTAA CCGCTTACGA TGTAGAGTTT ACCTACAACG ATATATATTA TAACCCTCAT ATTCCAAATT CTATAAAAGA TACGTTATCA GTGGATGGTA AACCTTTCAA GGTAAAAGCG TTAAACAAAT ACACTGTAGA GTTTGATTTG CCTCACCGGT TTGCCCCCTT TCTACAATCT ATAAGCGCTC CCATACTACC AAAGCATATT TTAGAAAACG CTGTAAAACA AAATACCTTT AACACATTTT GGAGTGTATC TCAAAAGCCT TCTTTAATAG TAGGCTCTGG TCCTTATAAG CTCGTCAAAT ATGTAAAAGG CCAATATGTA GAATATGAGG CAAATCCATA TTATTATAAA AGCCCGAAAA ACCTCCCTTA TATAAAACGT ATCAAAGCTT TTATTATACA AGATAAAAAT ATAGCCCTCA TTCAGTTTTT AGAAGGAGAT ATATCTTACA TTGGACTATC CCCAGAAGAT CTATCTTACT TTGCTTTAAA CAAACCAAAA ACCCCGGCTA TAGTATATGA CTTAGGGGAG ACTCCAACCA CCACGTTTAT CACATTCAAT CAAAATCCTC ATGCAGATAT ACCAAAATAC AAACTAAAAT GGTTTCAAAA CAGATTTTTT AGAGTGGCTA TATCTTACGC CATAGATAGA AAAGCTATAG CTTCTATGGT TTACAACAAC ATGGCAAGCC CGCTCTATGG ACCGATAACC CCCGCCAACA GACCCTACTA TAAAAAAGGT TTATTTAAAC GTTATCCTTT TAATCTATCA AAAGCCAAAA AACTTTTTAT AAAAGCTGGT TTTTATTACA AAAAAGGTAA GCTCTACGAT AAAGATGGTC ATAGAGTAGA GTTTAGTCTT ATCACAAACT CAGATGCCCA AGATAGAAAA TATATAGGAG CCATTGTAAA AGAAGACTTG GAAAAAATAG GTATAAAAGT GATATTTCAA CCAATAGACT TTAACTCTTT GGTATCTAAA CTTACAACAC CTCCTTATCA ATGGGAAAGC GTTTTGATAG GGCTAACTGG TTCCATTGAC CCAAACGATG GTAAAAACGT TTGGTATTCA AAGGGTTCTT TACATATATG GCATCCTATG GAGAAAAAAC CAGCCACACT TTGGGAAAAA GAGCTCGATA CGTTGTTTGA CGAAGGGTCA AAAACTATAA ATCCCAAAAA GCGTATAGAT ATATATAGAG CCGCTTTTTA TTTAATTCAG CATTACGAGC CTATGGTTTT CATAGTCACG CCTAAAAGCC TAATGACCTC AAAAGTTTAT ATGAAAAATT TTTATCCTAC AGTCTGGGGC TTTTATAAAA AAGATTATAT GTATATTAAA AGGTGA
|
Protein sequence | MYFAFKILKN PYINGILFTL ICFLPIYFSK SPKVSNISIK TVDLRKAHIS IGKTSNIAYI VLASDPKTLN PVLAQETSST DVIAPLFNGL TKIDLKTMSI KPELASSWKI LNNGKTYIIY LRKGLKWSDG KPLTAYDVEF TYNDIYYNPH IPNSIKDTLS VDGKPFKVKA LNKYTVEFDL PHRFAPFLQS ISAPILPKHI LENAVKQNTF NTFWSVSQKP SLIVGSGPYK LVKYVKGQYV EYEANPYYYK SPKNLPYIKR IKAFIIQDKN IALIQFLEGD ISYIGLSPED LSYFALNKPK TPAIVYDLGE TPTTTFITFN QNPHADIPKY KLKWFQNRFF RVAISYAIDR KAIASMVYNN MASPLYGPIT PANRPYYKKG LFKRYPFNLS KAKKLFIKAG FYYKKGKLYD KDGHRVEFSL ITNSDAQDRK YIGAIVKEDL EKIGIKVIFQ PIDFNSLVSK LTTPPYQWES VLIGLTGSID PNDGKNVWYS KGSLHIWHPM EKKPATLWEK ELDTLFDEGS KTINPKKRID IYRAAFYLIQ HYEPMVFIVT PKSLMTSKVY MKNFYPTVWG FYKKDYMYIK R
|
| |