Gene Haur_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1008 
Symbol 
ID5732912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1152262 
End bp1153950 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content46% 
IMG OID641278143 
ProductCHAP domain-containing protein 
Protein accessionYP_001543784 
Protein GI159897537 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00120354 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAACC ACCACTGGCG GATCTGCTTA ATTGTAGGAC TTTTAGTAAG TCTTGGTCTA 
TGGATGCAGC CTGCTAATGC GAACAAATCG TCAGATCCAA ATCGCTTGAT TGAGCCATCG
ATGGGTTTCG AGTTTGCTCA GCTCGATCAG TGGATACCAA GTATTTTTGC TGAGAATGGC
CCGCAAACAA ACTCACAAGT TGCCCAACGC AGCATTGGAT TTGTCCATCG CACAACCCCT
TCATTATTGG CAATTGTCAG TAGTTTCGCA AACCCGAAAG CGTTAAGTAG TAGCGGCTGG
ATTAAACGCT ATGATTATCG ACGAACTAAT GCTGCATATC AGATTGAAAC TATCGAATGG
CAAAAACGCA GTGTGCAGTT AATTAGTGAA CATCAGTTGA GTCGTCAAGC CCATGGAACT
CCCGATCATT TACGCTTGAT CATCGCGATC AATCAGCAGA TTATGGTTTT TGAATATATT
GGCTATAGCA TCAATCGGGC TGAATTCCTC GCTTGGCTTG AGCAGATCAC CCTAATTCCA
GCCCAAAAAT TCCAAGCATC ACCGTTGAGC AACGATGTTA AAGAGGCATT TGCTCAAGCT
AACCAGCCTT TAGCTGTATC AATCCAAAAT TGCTGTGGGG TTAGTGACCC TGAATTCAAT
CCTTTTCCTT GTAATAGCAG CGTTGGCAAT GGCAATTGTA CGTGGTGGGT CCGCTACCGC
CGAACAGGCA ACAATATTGC CAATTTATCT AATTGTACTG GCAACGCGGA TACATGGGAT
GAATGTGCTG CCAGCTCTTA TCCACAATTA CTCAGTGATA CGCCCAGCGT CAAAAGTGCC
GTGGTTTGGA CAAACATAAA TCATGTAGCA TTTTTGGAAC AAGTTAATAG CCCAACCAGC
ATTACGATGT CGCAAATGAA TTGGTATAGT CCATGTCCGC AATCAACTAT CACCCAAGGG
ATTACAAATA AGAAATTTAT TCGCCACCCT GATGCCATTC AACCCGAACC CGCAAAGCGT
TGGCATCTGA GCTACAATTT ATCAAGCGGC AATGCCGAGT TATCCTTTAA TTATGGCTTG
AAATCGGATA AAGCGGTAAT TGGCGATTGG AATGGTGATG GTATTGATAC GCCAGGGGTT
GTACGTGGCA ATACTTGGTA TCTTTCAAAC ACCTATGGCG AACCACATAC CATCAGTTTT
GAATTCGGTG ATCCCAATGA TATTCCGGTG GTTGGCGATT GGAACGGCGA TGGCAAAGAT
ACCCCTGGCC TTGTTCGCGG AACGACTTGG TATATCTCAA ACAACCTGAA TGGCGGCTGG
GCCGAACGAT CCTTCGGCTT TGGTGAGGCT GGCGACAAAC CCGTGGTTGG CGATTGGAAC
GGCGATGGCA AAGATAGCCC TGGGGTTGTG CGCGGCATAA CATGGTATCT TTCCAATAAT
CTTAATGGCG GCTGGGCTGA TATTTCGCTT GGCTTTGGTG AGCTAGGCGA TACATTCATC
GTGGGCGATT GGGATGGCGA TGGTGATGAT ACCCCTGGGG TTGTGCGTGG CAATATGTGG
TATCTCTCCA ACAACCTCAA TGGTGGTTGG GCCAATCTCT CCTTCATGTA TGGCGATCCC
GGTAACTATC CAATTGTTGG TAATTGGGGT GATAGCGATC GGAATAGTGA GATTGGCGTA
ATTCCCTAA
 
Protein sequence
MRNHHWRICL IVGLLVSLGL WMQPANANKS SDPNRLIEPS MGFEFAQLDQ WIPSIFAENG 
PQTNSQVAQR SIGFVHRTTP SLLAIVSSFA NPKALSSSGW IKRYDYRRTN AAYQIETIEW
QKRSVQLISE HQLSRQAHGT PDHLRLIIAI NQQIMVFEYI GYSINRAEFL AWLEQITLIP
AQKFQASPLS NDVKEAFAQA NQPLAVSIQN CCGVSDPEFN PFPCNSSVGN GNCTWWVRYR
RTGNNIANLS NCTGNADTWD ECAASSYPQL LSDTPSVKSA VVWTNINHVA FLEQVNSPTS
ITMSQMNWYS PCPQSTITQG ITNKKFIRHP DAIQPEPAKR WHLSYNLSSG NAELSFNYGL
KSDKAVIGDW NGDGIDTPGV VRGNTWYLSN TYGEPHTISF EFGDPNDIPV VGDWNGDGKD
TPGLVRGTTW YISNNLNGGW AERSFGFGEA GDKPVVGDWN GDGKDSPGVV RGITWYLSNN
LNGGWADISL GFGELGDTFI VGDWDGDGDD TPGVVRGNMW YLSNNLNGGW ANLSFMYGDP
GNYPIVGNWG DSDRNSEIGV IP