Gene Haur_1811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1811 
Symbol 
ID5733669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2103908 
End bp2105893 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content53% 
IMG OID641278954 
Producthypothetical protein 
Protein accessionYP_001544582 
Protein GI159898335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.032061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGAA AACGTTCATC GCTGGTGATC GTTGCAGCCT TGGTGGTCGC ATTATTGAGT 
AGCTTTGCCG CCGCCACTCT CGTCACTCCT ACCGCCGCTT CGTCCGATGG TTTTTGGCAA
GATGTGAAGG AAAGTCGGAT TGCCCAAAAA GGTGCTCGCC AAATTGTGCC AACGATCTAT
CGCACCGTTG CACTTGATGT CGCTGGATTG AGCCAATATT TGGCCAAAGC ACCACTAGAG
CAAGATCAGT CGGTGGCAAA CTCGTTGTAT ACCCTGCAAT TGCCCATGCC CAATGGCAAA
TTCGAGCAAT TCCGCGTTGT CGAATCGCCG ATTATGGAGC CAGAACTAGC GGCAAAATTC
CCCGAAATTC GCACCTACTT AATCCTTGGG GTGGATACTC CCAATCTGAG CGGGCGCTTG
GATCTGACCC CCGCCGGCTT TCATGGCTTA ATTCTAGGCG ATCAAGGCCG GATTTTCATT
GATCCCTACA GCGCTGGCGA TACTCAACAT TACATCGTTT ACGATAGCAA AAATTTTGTG
CCCAGCAGCC AAAAATTGGC TGATCGCCAA GTCGAAGATT ATGTGATCGA CCTACCATTG
ACCGATGATG GCTTGGCAGC ACCCAAAGCG GTTGGCGATA AATTGCGCAC CTACCGTTTG
GCCATGGCCG CCACTGGCGA GTACACTGCC TATCATGGTG GAACCGTTGC CAAAGGCTTG
GCGGCAATCA CCACCAGCGT CAATCGGGTT AACGCAGTGT ACGAACGCGA AGTTGCGGTA
CGCATGGTAC TTGTTGCCAA CAACAGCAAT ATCATTTACA CCAACGCCAG CACCGACCCC
TACACCAACA ACAATGGGGT TACGATGCTC AGCCAAAATC AGACCAATGT TCGCAATGTA
ATTGGCAATG CCAATTATGA TATTGGTCAC GTGTTCAGCA CTGGCGGCGG CGGGGTAGCC
TCGTTAGGCT CAGTCTGCTC GACCAACTAC AAAGCTCAAG GCGTAACTGG CTCATCGGCT
CCAGTTGGCG ATCCATTTGA TATTGATTAT GTGGCCCACG AAATTGGTCA TCAATTTGGC
GGCAACCATA CCTTCAATGG CACAACTGGT AGTTGTGGTG GTGGCAATCG CGCTAGTAGC
GCCGCCTACG AGCCAGGTAG CGGCTCGACA ATTATGGCCT ACGCAGGGAT CTGTGGCGCT
GAAAACTTAC AATCCAACAG CGATCCGTAT TTCCATTCCA AGAGCTTGAA CGAAATTACG
ACCTTCATCA CAACTGGCGG CGGCTCAAGC TGTGGCACAG CAACCAACAC TGGCAACACC
GCTCCGGTGG CCAATGCTGG GGCCGATTAC ACGATTCCAC GCAGCACACC ATTCGAATTA
ACCGGCACTG GCAGCGATGC CAACGGCGAT AGCATGACCT ACAACTGGGA GCAATATAAC
TTGGGCACGG CTGCGCCACC CAACACCGAC AACGGCTCAC GCCCAATTTT CCGTAGCTTC
AACCCAAGCA CCAACCCTAA GCGCTCGTTC CCTAAATTGA GCGATATTTT GAACAATACC
GCGACAATTG GTGAATCATT GCCAACCACC AGCCGCACCA TGGTTTTCCG GCTGATTGTG
CGCGATAACC GGGCAGGCGG CGGCAGCTAT GCGATTGATA GCGCCAATGT GACGGTCAGC
AGTGCGGCTG GGCCATTCGC CGTAACCTCG CCGAATACCG CCGTCACTTG GACTCGCAAC
AGCAGCCGTA CCATCACTTG GAATGTTGCC AGCACCACCG CTGCACCAAT TAGCTGCGCC
AATGTTGAAA TTTTGTTCTC AAGTAATGGT GGCAGCAGCT TCAGCAGCTT GGTTAGCTCA
ACCCCCAACG ATGGCAGCCA AGCCGTTACC ATCCCCAACA CCGCAACCAC CCAAGCACGA
ATCAAAGTAC GCTGTGCCAA CAACGTCTTC TTTGATCTTT CAAATGTGAA CTTTACCGTA
AACTAG
 
Protein sequence
MPGKRSSLVI VAALVVALLS SFAAATLVTP TAASSDGFWQ DVKESRIAQK GARQIVPTIY 
RTVALDVAGL SQYLAKAPLE QDQSVANSLY TLQLPMPNGK FEQFRVVESP IMEPELAAKF
PEIRTYLILG VDTPNLSGRL DLTPAGFHGL ILGDQGRIFI DPYSAGDTQH YIVYDSKNFV
PSSQKLADRQ VEDYVIDLPL TDDGLAAPKA VGDKLRTYRL AMAATGEYTA YHGGTVAKGL
AAITTSVNRV NAVYEREVAV RMVLVANNSN IIYTNASTDP YTNNNGVTML SQNQTNVRNV
IGNANYDIGH VFSTGGGGVA SLGSVCSTNY KAQGVTGSSA PVGDPFDIDY VAHEIGHQFG
GNHTFNGTTG SCGGGNRASS AAYEPGSGST IMAYAGICGA ENLQSNSDPY FHSKSLNEIT
TFITTGGGSS CGTATNTGNT APVANAGADY TIPRSTPFEL TGTGSDANGD SMTYNWEQYN
LGTAAPPNTD NGSRPIFRSF NPSTNPKRSF PKLSDILNNT ATIGESLPTT SRTMVFRLIV
RDNRAGGGSY AIDSANVTVS SAAGPFAVTS PNTAVTWTRN SSRTITWNVA STTAAPISCA
NVEILFSSNG GSSFSSLVSS TPNDGSQAVT IPNTATTQAR IKVRCANNVF FDLSNVNFTV
N