Gene HY04AAS1_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0078 
Symbol 
ID6742861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp70234 
End bp72216 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content36% 
IMG OID642749862 
Productgeneral secretion pathway protein D 
Protein accessionYP_002120748 
Protein GI195952458 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTT TAAGAAAAAT AAATTGTTTG AATATATTTT CCGTTTTATG GATTTTTATA 
GTGTTTGGAA TGGCCCATGG AAAGGGAAAA ACCAAATACT TAAACAACAT GGTAGTACTA
AACTTCCAAA ACCAAAGCAT AGATGATATA GCAAAGTTTA TGTCAAAGCT TACTGGTAAA
ACTATAGTGA TAGGGATAAA GGGAAATCTT CCTAAAATAA CGGTTAGCTC CAAAAAACCT
GTAAGCGTAG AAGACGCTTG GCATCTTTTT TTAACGAGCT TAGCTTTAGA TGGGTACACT
GTTGTAAAAT ATAAAAACTT TTACAAGATA CTTCCTCTAA AAGAAGCATC TTCTTTTAGC
ACAAGTGTTA CAAAAAAAGC CTTTCCAAGC CCCTCTATAG AAACCTACAT TTACTTTGCC
CATACAAACT CCCAAATACT TCTAAATGCC GTAAGACCTT TTTTAAGCCA ATATGGTAAC
GCCACCGTTT ATATGCCCTC AAACGCCCTT ATCATATCTG ATATAGGAGT GTCTGTTGAT
AAAATCCAAA AACTTTTAAA GAGTATAGAT ATACCAAATT TAGCTTTTAG TCTAAAAATG
TATCAAACAA AAGATACAAA CGCCGTTGTA AAAGCCCTAA GCCCCTTGGC AAACCCTGTA
AGTCAAAAGT TTGGTATACC TATGGTGGTA TCTTCTGTTC AGAAAAAACA CTCCAAAAGC
GGTTTTGTGC TTGTATATGC ACCGAAGCTT ATGCAAGCCT CTATAAAAGA GATTATACAT
AAAATAAACG AAAGCGCCTC TCATTTTAGA AGACATTATT ATGTTATCCC ACTTCAAAAC
GCCTCTGTAG GAGAAATGGC AAAGACGTTG GCAAGCCTTT TTGGAAGCGC AAGCGCCATT
TCCTCTACCA CAAAAAGACC TACACCAAAC CTAAACACTA TGCAAAACCA ACCTCAAACA
ATTCAAAGCA ACGTACCTCC AAATCAAAAC ATAAACGTAA TATCTTCCAA CAAACCAATA
GGCTCTATAT ATCTTTCTGA TGGCACGAGG ATAGGTTTTG ACAGAGCCAC AAACAGCGTT
ATTTTATACG CCACAAAATC TCAATACGAA AATCTAAAAA ATCTCATAAA AAAACTAGAC
GAAAAACGCA TTCAAGTGCT GATAGCGGCT TCTGTAGTGG AGGCAAACCT CACCAAACAA
CTTACCACCG GTGTAAATTG GCAAGCTTTG GGCAAAAATG GCGGTATAGG ATTTAACCCG
GCATCTCTTC AAACCATATA CCAAGGGCTT TTATCTGGAA ACTTCGTAGT AGGTGTTACA
AGCTCAAGCA GTATAAGTGC AAACGTAGGT GGTAATACAA TCATATTTCC TGATTTAGCG
GTATTTTTAA GTTTGTTAGA GCAAGGAAAT GGTTTTAAAA TCATATCAAA CCCAAAGGTG
CTAACCCTTG ACAACGAAGA GGCTATTATA AAAGAAGCTC AAGTTTATCC TTATGTAACA
GGTACCCAAT ACAACATAAA CGGCTTCCCA ATACTCACTT ACGACTACAA AGATATAGGC
CTAGAGCTTG ATGTTATACC TACTGTTTCA AAAGACAACA TAAGGCTTGG CATAAACTTA
AATCTTCAAG ATATCACGGG CTTTACAAAT ACAAACGTAG CGGGTCAAAC GGTGCCTATA
CCTATTACCA CAGATAGAGT TTTAAATTCG GAAGTAGTCG TTAAAAGCGG TCAAACGGTG
ATATTAGGAG GACTCGTGAG CAACAACACT ATAAAAAACA TAAGCGGTAT ACCAATTCTT
CAAGATATCC CAGTTTTAGG AAATCTCTTC AAATATCAAA ACAGAGAAAA TAAAAAAAGC
ACTCTTTTTA TATTTATAAC ACCTTACATC ATAAAAAGCC CAGATCAACT TGCCAAAATT
ACGAAAGCAA ATGAAGTAAT AGCGCACAGA ATATACGAAA GTGTGAAAAA AGCAAAAGAA
TAA
 
Protein sequence
MKILRKINCL NIFSVLWIFI VFGMAHGKGK TKYLNNMVVL NFQNQSIDDI AKFMSKLTGK 
TIVIGIKGNL PKITVSSKKP VSVEDAWHLF LTSLALDGYT VVKYKNFYKI LPLKEASSFS
TSVTKKAFPS PSIETYIYFA HTNSQILLNA VRPFLSQYGN ATVYMPSNAL IISDIGVSVD
KIQKLLKSID IPNLAFSLKM YQTKDTNAVV KALSPLANPV SQKFGIPMVV SSVQKKHSKS
GFVLVYAPKL MQASIKEIIH KINESASHFR RHYYVIPLQN ASVGEMAKTL ASLFGSASAI
SSTTKRPTPN LNTMQNQPQT IQSNVPPNQN INVISSNKPI GSIYLSDGTR IGFDRATNSV
ILYATKSQYE NLKNLIKKLD EKRIQVLIAA SVVEANLTKQ LTTGVNWQAL GKNGGIGFNP
ASLQTIYQGL LSGNFVVGVT SSSSISANVG GNTIIFPDLA VFLSLLEQGN GFKIISNPKV
LTLDNEEAII KEAQVYPYVT GTQYNINGFP ILTYDYKDIG LELDVIPTVS KDNIRLGINL
NLQDITGFTN TNVAGQTVPI PITTDRVLNS EVVVKSGQTV ILGGLVSNNT IKNISGIPIL
QDIPVLGNLF KYQNRENKKS TLFIFITPYI IKSPDQLAKI TKANEVIAHR IYESVKKAKE