Gene HY04AAS1_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1182 
Symbol 
ID6743999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1092060 
End bp1093805 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content34% 
IMG OID642750991 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002121845 
Protein GI195953555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTG CTTTTAAAAT ACTCAAAAAT CCCTACATTA ACGGTATTTT ATTTACGCTC 
ATATGTTTTT TGCCTATATA TTTTTCCAAA AGCCCAAAAG TCTCCAACAT ATCTATAAAA
ACTGTAGACC TAAGAAAAGC TCATATATCC ATTGGCAAAA CATCTAACAT CGCTTATATC
GTTTTAGCTT CAGATCCCAA AACCCTAAAC CCAGTACTAG CTCAAGAGAC ATCTTCCACC
GATGTAATCG CCCCTCTTTT TAACGGCCTT ACAAAGATTG ACTTAAAAAC TATGAGCATA
AAACCAGAAC TCGCCTCTTC TTGGAAAATA CTAAATAACG GAAAAACCTA CATAATATAT
CTTAGAAAAG GTTTAAAATG GTCCGATGGA AAACCCCTAA CCGCTTACGA TGTAGAGTTT
ACCTACAACG ATATATATTA TAACCCTCAT ATTCCAAATT CTATAAAAGA TACGTTATCA
GTGGATGGTA AACCTTTCAA GGTAAAAGCG TTAAACAAAT ACACTGTAGA GTTTGATTTG
CCTCACCGGT TTGCCCCCTT TCTACAATCT ATAAGCGCTC CCATACTACC AAAGCATATT
TTAGAAAACG CTGTAAAACA AAATACCTTT AACACATTTT GGAGTGTATC TCAAAAGCCT
TCTTTAATAG TAGGCTCTGG TCCTTATAAG CTCGTCAAAT ATGTAAAAGG CCAATATGTA
GAATATGAGG CAAATCCATA TTATTATAAA AGCCCGAAAA ACCTCCCTTA TATAAAACGT
ATCAAAGCTT TTATTATACA AGATAAAAAT ATAGCCCTCA TTCAGTTTTT AGAAGGAGAT
ATATCTTACA TTGGACTATC CCCAGAAGAT CTATCTTACT TTGCTTTAAA CAAACCAAAA
ACCCCGGCTA TAGTATATGA CTTAGGGGAG ACTCCAACCA CCACGTTTAT CACATTCAAT
CAAAATCCTC ATGCAGATAT ACCAAAATAC AAACTAAAAT GGTTTCAAAA CAGATTTTTT
AGAGTGGCTA TATCTTACGC CATAGATAGA AAAGCTATAG CTTCTATGGT TTACAACAAC
ATGGCAAGCC CGCTCTATGG ACCGATAACC CCCGCCAACA GACCCTACTA TAAAAAAGGT
TTATTTAAAC GTTATCCTTT TAATCTATCA AAAGCCAAAA AACTTTTTAT AAAAGCTGGT
TTTTATTACA AAAAAGGTAA GCTCTACGAT AAAGATGGTC ATAGAGTAGA GTTTAGTCTT
ATCACAAACT CAGATGCCCA AGATAGAAAA TATATAGGAG CCATTGTAAA AGAAGACTTG
GAAAAAATAG GTATAAAAGT GATATTTCAA CCAATAGACT TTAACTCTTT GGTATCTAAA
CTTACAACAC CTCCTTATCA ATGGGAAAGC GTTTTGATAG GGCTAACTGG TTCCATTGAC
CCAAACGATG GTAAAAACGT TTGGTATTCA AAGGGTTCTT TACATATATG GCATCCTATG
GAGAAAAAAC CAGCCACACT TTGGGAAAAA GAGCTCGATA CGTTGTTTGA CGAAGGGTCA
AAAACTATAA ATCCCAAAAA GCGTATAGAT ATATATAGAG CCGCTTTTTA TTTAATTCAG
CATTACGAGC CTATGGTTTT CATAGTCACG CCTAAAAGCC TAATGACCTC AAAAGTTTAT
ATGAAAAATT TTTATCCTAC AGTCTGGGGC TTTTATAAAA AAGATTATAT GTATATTAAA
AGGTGA
 
Protein sequence
MYFAFKILKN PYINGILFTL ICFLPIYFSK SPKVSNISIK TVDLRKAHIS IGKTSNIAYI 
VLASDPKTLN PVLAQETSST DVIAPLFNGL TKIDLKTMSI KPELASSWKI LNNGKTYIIY
LRKGLKWSDG KPLTAYDVEF TYNDIYYNPH IPNSIKDTLS VDGKPFKVKA LNKYTVEFDL
PHRFAPFLQS ISAPILPKHI LENAVKQNTF NTFWSVSQKP SLIVGSGPYK LVKYVKGQYV
EYEANPYYYK SPKNLPYIKR IKAFIIQDKN IALIQFLEGD ISYIGLSPED LSYFALNKPK
TPAIVYDLGE TPTTTFITFN QNPHADIPKY KLKWFQNRFF RVAISYAIDR KAIASMVYNN
MASPLYGPIT PANRPYYKKG LFKRYPFNLS KAKKLFIKAG FYYKKGKLYD KDGHRVEFSL
ITNSDAQDRK YIGAIVKEDL EKIGIKVIFQ PIDFNSLVSK LTTPPYQWES VLIGLTGSID
PNDGKNVWYS KGSLHIWHPM EKKPATLWEK ELDTLFDEGS KTINPKKRID IYRAAFYLIQ
HYEPMVFIVT PKSLMTSKVY MKNFYPTVWG FYKKDYMYIK R