Gene Haur_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1210 
Symbol 
ID5733103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1394206 
End bp1395696 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content47% 
IMG OID641278350 
Producthypothetical protein 
Protein accessionYP_001543986 
Protein GI159897739 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0023593 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTT ATGCCGACTT TGAATTAACG ATCAACCCAA AAGATGCTGA AAACAGTTTT 
GTGGTGCATG GGCGTACTGC CAAAGGCATG CAAGATAGCG ATAGCCTGAT TTTGCCCGTT
GACGATCCAC GTTATCAAGC CTTTCAAACC GCCTTGGACT ATAACACCCC GCTGACTGAA
GATCAGGTGA TTGATTTTGG GATCGTGCTC TACGAAACAT TGTTGAAAGG GAAAATTTGG
GCATTATTTA CGGCAGCCCG TGAGACTGCT CGCTCCCAAG GTCAATCATT ACGGATTAAA
CTTAATGTTG ATGCCAACAA CCCTGCTTTA GCGACAGTTG CCACGATTCC CTGGGAGTTT
GCCTGCGATA GCGCAGGAAT TCCACTGACA ACCGATCATT CAATCTGTCG ATTTTTGACT
TTTCCTGAAT CAGTACCAGT CTTGAGTTTG GGTCAAGAAA AATTACGGAT CGCCTTAGTG
GGAGCCTTGC CAGCTGAAAT GGCTACTACC CATCCAGTCG ATATCCAAGG TGAATTAGCG
GCGATTCATC GTTCGTTAGA GCCGTTGGTT ACCCAAAATC AAGTTGAAAT TTATGAAGAA
ACTCAGTTAA CTGCACCCAA ACTTCAGCGG CTTGTGCGCG AATGGCGACC ACATATCGTG
CATTATGTCG GTCATGGCGA TTTCCAAGGC ACAACTGGGG CATTGATTCT CGATGATGGC
AATGGCAAAA AACATCTTTC AACCGCTCGC ACATTAGCAA CCCTCTTGCG CAATACCTCA
GTGCGCTTGG TGGTGCTGAA TGCTTGTAAA ACTAGCACGG TTTCCTCAAC CGCCTTGCTG
CGTGGAATTG CCCCAGCCCT GATGGCGGCC AATATTCCAG CGGTTGTCGC CATGCAATCA
TCAATTTTAG ATACAGCAGG CAAGGCCTTT GCCGAAGAGT TTTATCGGGT ACTCGCAACT
GGTACGCCAA TTGATGCTTG TGTTGCTGAA GGGCGTAAAT CGATTATTGC CTATGGCTTT
GGCCAGCTTG ATTGGGGCTT GGCAACGCTC TATATGCGGG CTGATGATGG CGTGCTGTTC
AATATTCCCA CTCCATCAGT GCCAAGCAAT CAGGTGGTAA CTCCAACTAA CGAAACTACT
CCGCTCGCAA ATCTAACTGG TGGCAATAGT GTTAGCAATT TGCTGGGTAA CAATAACACC
ATTACTGGCG GTAATATCTC AATTGGCAAT GTGGTTGCTG GCAATCATAA TCAAACTACT
ATCAACCATG GTGTTGCTCA GCCAAATACT CCAAGTGCAG CCAATAATCA GCAACAAGCC
CTCGCAGCGG AACGCGAATT GCTGGCACTC AAACAGAAGA ATTTAAATAT TACCAAACTG
CAAATTGAAC AATACGGGAT TGGCGTGCCA GTGTATTTAC AAAACCAACA CGATGAGTTG
GTCAAGGATA TTGCGGCAAT TCAACAACGG ATTGCTGAAT TGAGCAAATG A
 
Protein sequence
MSGYADFELT INPKDAENSF VVHGRTAKGM QDSDSLILPV DDPRYQAFQT ALDYNTPLTE 
DQVIDFGIVL YETLLKGKIW ALFTAARETA RSQGQSLRIK LNVDANNPAL ATVATIPWEF
ACDSAGIPLT TDHSICRFLT FPESVPVLSL GQEKLRIALV GALPAEMATT HPVDIQGELA
AIHRSLEPLV TQNQVEIYEE TQLTAPKLQR LVREWRPHIV HYVGHGDFQG TTGALILDDG
NGKKHLSTAR TLATLLRNTS VRLVVLNACK TSTVSSTALL RGIAPALMAA NIPAVVAMQS
SILDTAGKAF AEEFYRVLAT GTPIDACVAE GRKSIIAYGF GQLDWGLATL YMRADDGVLF
NIPTPSVPSN QVVTPTNETT PLANLTGGNS VSNLLGNNNT ITGGNISIGN VVAGNHNQTT
INHGVAQPNT PSAANNQQQA LAAERELLAL KQKNLNITKL QIEQYGIGVP VYLQNQHDEL
VKDIAAIQQR IAELSK