Gene Haur_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1598 
Symbol 
ID5733485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1853285 
End bp1855312 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content51% 
IMG OID641278737 
Producthypothetical protein 
Protein accessionYP_001544369 
Protein GI159898122 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000695683 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAC GTGTTGTTTG GTTCGCTGTG TTGCTGGTGG CCGTGCTTGG CGTGCCTTTT 
AGCGTTTGGG CTGCAACCCC GCGTCAAACT CCGACCAAAG CACCGCTGGC CGTTCAAGCT
ACTGTGCGAG TTGTGGAATC GGTTCCCGTG GTGGCTCCGG CGGTGGCAGT AACGGCAACT
GCGGTTACCT GCGATCCGCT GACCACCCGC TCGTTATATG GCTGGTTCAC CTCGAACACG
ACTGGCTCGG TTTACAATAG CTCATACACA TGTGCATTTG AGGTAGGGAT TGCCGCATAC
AGCATGTTTG ATGATCGAAT TGGCAACCAA CAATTGTTTG ACTACACCAT GTATACCGTG
GCTCCCCGTC AAACCATTAA CTTGAATGTA AAAATTCCTG ATTGCAAAGC CCAACTGGAT
ATCTTCCGTG GGCCAGTCTT GCACTCGTTG GTTGGTCAGC GCTATGGCGA ACGCTTGCTT
TCAACGCGCT TCCCCAACAC AACCCTTTGT GCTCCACCAG TTGAAGAAGT GTGTAGCCAA
GGCCAAATCA GCCGCTTGAG TGGCGTGAGC AACAATCAAT TGGTAACTGG AGTTTTGAAT
ATTCAAGCTG AAGTTACTGG CGCTTTGCCC CAAAAAGTTG AATTTGCTTT GACTGGTGCC
CAAACCACCA ACTATACCGA TGTCAACTCG CCCTACTATT TCATGGGCAA CAATGGTAGT
CAGCCCAATG GTTGGGATTC AAGCACCAAG CCCGAAGGCG ATTATCGCTT GAGCGCAACC
TATGTTGGTT TGTTTGGTGA ATCATTAGCG ATTCGCTGTG AGCCTGTGGC AGTCAATTTC
AGCATTCGCC GCAGCACGCC AACCACCGAG CCAACCGCAA CCAGCACACC GTTGCCAACG
GCAACCAACA CTCCAGTGCC AACCGCGACG AGTACGCCAA CGGCCACTGC AACCAACACG
CCAGTGCCAA CGGCAACCAA CACTCCAGTG CCAACCGCGA CGAGTACGCC AACGGCCACT
GCAACCAACA CGCCAGTGCC AACGGCAACC AACACTCCAG TGCCAACCGC GACGAGTACG
CCAACGGCTA CTGCAACTAG CACACCGTTG CCAACGGCAA CCAGCACACC GTTGCCAACG
GCAACCAGCA CACCGTTGCC AACCGCGACT AACACGCCAG TGCCAACAGC AACTAGTACT
CCAGTACCAA CCAGTACTCC TGTGCCTGGT AACCAATGTG TACCACAAGG GTTAGGCACT
GCTGGCGATT TCAATGTCTT TACCTTTGGC AACATCACCC AAAGCAACAC CGATATTGAA
GGTCGGGTCG CTGCTGGTGG CAATATCAAC TTCCAAAACT TTGGGGTTGG CGTGCGCTTG
ACCAATTCAA ATGGTACGCG CGACGACTTA GTTGCCGGCG GTTCGTTGAC CTACACCAAC
GGTTCGGTTT ACAACGGCAA TGTGGTTTAT GGCACGACTA AATCATTGAA TGGCGTTAGC
GTCTTGAATG GTACCGTTCG CCAAGGCCAA CCAATCAACT TTGCCAACGA GCAAACTTCA
TTGCGCAACC GTTCGCAAGC TTGGGGTGGC TTGAGTGCTA ATGGCACGAC GGTCTATGAA
TATGGTGCAG TCAAGTTGAG CGGAACCAAT ACAACCCTGA ATATCTTCAC GGTTGATGGT
GCTCAATTGA ATAATGCTAA TGGCTTGAAC ATCAACGTTC CAGCAAGTTC ATCAGTCTTG
ATTAATATCA CTGGCACGAA CAATCGAATG CAAAATTTCG AAACCTTCTT GACAAATGTT
GATCAAACCA AGATTGTCTA CAACTTCTAC CAAGCCACTA GCTTTAGCCT CTCAGGGATT
GGCATCAAAG GTACAATCTT GGCTCCATTT GCTGATGTAA GCTTTAGCAA TGGCCAAATC
AACGGGACAT TGATTGGTAA CTCATTGATT GGTGGTGGCG AATCACACCA TTATCCATTC
AATGGTTGTT TGCCAGCAAT TCCAGCTAAC AAGTCAGTTG AACGCTAA
 
Protein sequence
MKQRVVWFAV LLVAVLGVPF SVWAATPRQT PTKAPLAVQA TVRVVESVPV VAPAVAVTAT 
AVTCDPLTTR SLYGWFTSNT TGSVYNSSYT CAFEVGIAAY SMFDDRIGNQ QLFDYTMYTV
APRQTINLNV KIPDCKAQLD IFRGPVLHSL VGQRYGERLL STRFPNTTLC APPVEEVCSQ
GQISRLSGVS NNQLVTGVLN IQAEVTGALP QKVEFALTGA QTTNYTDVNS PYYFMGNNGS
QPNGWDSSTK PEGDYRLSAT YVGLFGESLA IRCEPVAVNF SIRRSTPTTE PTATSTPLPT
ATNTPVPTAT STPTATATNT PVPTATNTPV PTATSTPTAT ATNTPVPTAT NTPVPTATST
PTATATSTPL PTATSTPLPT ATSTPLPTAT NTPVPTATST PVPTSTPVPG NQCVPQGLGT
AGDFNVFTFG NITQSNTDIE GRVAAGGNIN FQNFGVGVRL TNSNGTRDDL VAGGSLTYTN
GSVYNGNVVY GTTKSLNGVS VLNGTVRQGQ PINFANEQTS LRNRSQAWGG LSANGTTVYE
YGAVKLSGTN TTLNIFTVDG AQLNNANGLN INVPASSSVL INITGTNNRM QNFETFLTNV
DQTKIVYNFY QATSFSLSGI GIKGTILAPF ADVSFSNGQI NGTLIGNSLI GGGESHHYPF
NGCLPAIPAN KSVER