Gene Haur_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0185 
Symbol 
ID5732094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp214663 
End bp215820 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content49% 
IMG OID641277309 
ProductPilT domain-containing protein 
Protein accessionYP_001542965 
Protein GI159896718 
COG category[R] General function prediction only 
COG ID[COG4956] Integral membrane protein (PIN domain superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000320699 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTT CGGAAAAAAA GGCTGTGGCT GTAAACAGAA ATCGCTTACT TAGCATTGAT 
TTTTGGGTGC GCATTTTGGG GATGGTCGTG CTTGGCTATA TTGGCTGGTA TTTTGGCTCT
AGCTCGGCCA GCAATCCACG AACAAGCGAT GAAACCTTAG CCATGCAGCT ACTCACACTT
TCGGGGGCTG GCTTGGGTTT ATTGATATCC CATCGGATAA CCTTATACCC AATTCGCAAT
ATCAACCAAC GCTTGCGTCG TAGCACGGCT CAAGAGTTGA TTGCCTTGGC GCTGGGATCA
TTATTGGGTG TGATGCTCGC AGCATTATTA TCCATCCCCC TAAGCCAACT ACCTGGTCTT
TTAGGCAGCT ATTCGCCGGC ACTTGCCAGT TTCTTTATTA TCTACTTTTG TGTTGTAGCC
TTCGAGTATC ACAAGAAAAA TCTGGTTAAT TTTGGGGTTT CGCTGCAAAC ACCCAAAGTT
CGCGCAGTTA AAGAAGCTGT CCCAATGCGT CGAACCTGTT TGGTTGATAC AAGCGCCATT
ATTGATGGTC GAGTTTTGGC GGTTGTACGC AGTGGGTTTC TTGATGGGAT TTTGGTTGTG
CCGCGCTTTG TGCTCAACGA ATTGCAATTA TTGGCCGATT CGAGCGATGA TATGAAGCGG
ATGCGCGGTC GCCGTGGCTT GGATATGCTC GAAGAAATTC GCAAAGATGA TCAATTGCGG
CTCGAAATGC CCAACGATGA TATTGCCAAT GCGCGTGGCG TTGACCAGAA GTTGGTGACC
TTGGCGCTGC AAGATGGTCA TGCCTTGATT ACCAACGATA AGAATTTGAG CCAAGTTGCT
GAATTACAAG GCGTGCAGGT ACTCAATTTA AATGTGCTTT CCGATGCGGT ACGCCCACCC
GTTGGGGCTG GCGAAATGTT GGTTGTGAAA GTGCGTGAAG AAGGCCGCGA GCGCGAACAA
GGCATTGGCT ACCTCGAAGA TGGCACTATG GTAGTCGTCG AAGATGCCCG CGAACGGATC
GGTGATGAAG TTCGGGTGAT CGTCAGCCGG GTCTGGACGA ACGATCGTGG TCGCATGGTC
TTTGGGCGAA TTATGGGCAG TGCTGGAGCA TTTTACGGGG GCAAAAACGA TGCGGGCAAT
TATCCAGCGC GTAGCTAA
 
Protein sequence
MTISEKKAVA VNRNRLLSID FWVRILGMVV LGYIGWYFGS SSASNPRTSD ETLAMQLLTL 
SGAGLGLLIS HRITLYPIRN INQRLRRSTA QELIALALGS LLGVMLAALL SIPLSQLPGL
LGSYSPALAS FFIIYFCVVA FEYHKKNLVN FGVSLQTPKV RAVKEAVPMR RTCLVDTSAI
IDGRVLAVVR SGFLDGILVV PRFVLNELQL LADSSDDMKR MRGRRGLDML EEIRKDDQLR
LEMPNDDIAN ARGVDQKLVT LALQDGHALI TNDKNLSQVA ELQGVQVLNL NVLSDAVRPP
VGAGEMLVVK VREEGREREQ GIGYLEDGTM VVVEDARERI GDEVRVIVSR VWTNDRGRMV
FGRIMGSAGA FYGGKNDAGN YPARS