Gene Haur_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1023 
Symbol 
ID5732927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1166937 
End bp1168370 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID641278158 
Producthypothetical protein 
Protein accessionYP_001543799 
Protein GI159897552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCA CCACAAGTCA AGAGCTGATC GGCTGGCATC GGTATCTTGA GCAATTGGGT 
CCACGGCTGA CTGGCAGTGA GGCTCATCAG GCCTTTATCG AATTTTTAGC GACAGAATTA
ACCAGTTTGG GCTGTGAGGT TCAGCGTGAT CGCTATTATT TTACCCGTTG GCAAGCTCAA
AACTGGAGTT TAGCGCTCCT CGATAGCGCT GGCAACGAAA CGAGCATTCC CTGTAGCTTT
TATTACCCAT ATTCAGGCTC TACACCGCCT GAGGGCATCA TTGGCGAGTT GGTGGATTGT
GGCAAAAGCC CTGGCAATTT TCAGCAAGCT GCTGGCAAAA TTGCTTTGGT TGAAGTGGCA
GTTCCGGCGT TACCAACCAT GCTATTTCTG CACCCAACCA AATTCGCCCA AGCCCAGAAA
TTGCCAAAAC TGCTGCGAAA CCCAACCCTT GGCTCGTTTT TGACTGGCCC AAATTTAGCC
GCCGCCAAAC AAGCTGGAGT AAAAGGGGTA ATTTGCATTT GGTCGAAAAT CTCAGCGGCC
AATGCCGATG CTCAATATTT GCCCTTCACC ACCAGCTATC AAGCTTGCCC AGCGCTTTGG
GTCAACGCTG CGGTTGGTCA GCAACTCAAA CAGGCAGTTG GCCAAAAGAT TCGCTTCACC
CTCGAAGCAA CGCTCACTGA GCAATGCCCA ACTGATAGCT TGTATGTGGT TTTGCCAGGT
CAGCAATCCA ACGAAAGCCT TTTGATTAAT ACTCATACCG ATGGGCCGAA CGCACCTGAG
GAAAATGGCG CACTGGGCTT GCTGGCATTA GTGCGTTGGT TCAAACAGCA GCAGCATCAA
CGGAATTTGA TTTTTATTTT TGCCACAGGC CATTTTCAAT TGCCGCAACT TGGCAAGCAT
GGCCAAGCTA CCAGCACATG GCTCGCTGAG CACCCCGAAT TGTGGAATGG CCAGCAAATG
CGGGCAATTG CAGGTGTGAC CTTAGAGCAT TTGGGCTGTA CTGAATGGCT CGATAATCGG
GCATTAAGCG ATTATCAACC AAGCCAGCAA CCTGAACTTG AGCTAACCTA CACCACCAGC
CCAATGTTGG AGCAACTGTA TTACACCGCG TTGTTGCAGC GCACCAAACA GCGGGTACTC
ACGATTATGC CAATTAACGA GATTTATTTT GGCGAGGGCG AGCCATTCTA CAAAGCCAAC
ATTCCGACAA TTTCGCTGAT TCCAGCGCCT AATTATCTAT GTGCCACACC GAGCAACGCT
GTAATCGATA AACTTGATTT TGATTTGATG CAGCAACAAA TCGAAACCTT CGCCCGCGTG
ATCAAGATGA TCGATCAGAT CAGCACTAGC CATTTGGGCG TAGCTGAACC CCAGCCATTT
AGCCTTGTTG GCAGTGTATT CCGCCGCATG GTTGGAGCCA ATCAGCGGCA CTAA
 
Protein sequence
MKITTSQELI GWHRYLEQLG PRLTGSEAHQ AFIEFLATEL TSLGCEVQRD RYYFTRWQAQ 
NWSLALLDSA GNETSIPCSF YYPYSGSTPP EGIIGELVDC GKSPGNFQQA AGKIALVEVA
VPALPTMLFL HPTKFAQAQK LPKLLRNPTL GSFLTGPNLA AAKQAGVKGV ICIWSKISAA
NADAQYLPFT TSYQACPALW VNAAVGQQLK QAVGQKIRFT LEATLTEQCP TDSLYVVLPG
QQSNESLLIN THTDGPNAPE ENGALGLLAL VRWFKQQQHQ RNLIFIFATG HFQLPQLGKH
GQATSTWLAE HPELWNGQQM RAIAGVTLEH LGCTEWLDNR ALSDYQPSQQ PELELTYTTS
PMLEQLYYTA LLQRTKQRVL TIMPINEIYF GEGEPFYKAN IPTISLIPAP NYLCATPSNA
VIDKLDFDLM QQQIETFARV IKMIDQISTS HLGVAEPQPF SLVGSVFRRM VGANQRH