Gene Haur_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0207 
Symbol 
ID5732102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp241511 
End bp242764 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content50% 
IMG OID641277331 
Producthypothetical protein 
Protein accessionYP_001542987 
Protein GI159896740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.835179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCACA ACTTCTATGC AACCTTACCC ATTATCACCG ATTTTGTCCA AATTACCGAT 
GCCAATTGCT ACCATCGCGT CCCTGACGAT TGGGTTATTG TGGTGAGTGA TATTGAGCAA
TCGACCAAGG CGATTGGCGA GGGGCGCTAC AAAGATGTGA ATTTTATTGG CGCTAGCACG
ATTGTAGCCT TACTCAATTT GCAGCCCAAT CTCGATATCC CGTTTGTGTT CGGCGGCGAT
GGCGCAACCG TGTTATTGCC ACCATGGTTA GTCGAGCAAG CCAAACCTGC CTTGCAAGCA
GTCCAACATT TGAGCGAATC GATCTACAAT TTGCATTTGC GGGTGGGTAT TATGCCAGTC
AGTGAAGTTT ATGCCCATCG CTATCAGCTG GAAATTGCCA AATTCGCCGC CTCGGACAAT
TATGCCCAAG CGATGATCAA TGGTGATGGT TTGACCTTCG TCGAACAAAC GATCAAAGAT
CCCGTGGCCG GTGCGAAATA TTTGCTAGCC GCCCAGTCGA GCGATCAACC AGGCTTGCTC
GATGGCCTCG AATGTCGCTG GCAAGAAATT CCCAGCCGTT ACGGCGAAAC GGTTTCGCTC
TTAGTTCGGG CCGAGGCCAA CACCACTACT CAACGTAATG CAATCAATCG CCAAGTTATT
GAGCAGATTG AGGCTATCTA CGGCGCTGAC GATTCGCATC ACCCCGTCGA TGTACAACAA
CTAAGCCTAA CCTTACGGAT TCAAGATCTG TGGGGTGAAG CCCGCTTGCG TGGTGGTACC
AGCAAACTGC AACAATTGCG CTATCTCAAT AATATTTGGT GGCTGAATGT GCTGGGCAAA
TTGCTGTTGG CAACTGGAGC TAAAACCGAA TTAACCGATT GGGCCGAATA CCCACAGATT
TTGCAAGCCA GCACCGATTA TCGCAAATAC GATGCAATGC TGCGCATGGT GATTGCCGGA
ACTCCTGAGC AACGCCAGCA GCTTGAACAA TTTCTGAATG CTGAACGGGC GGCTGGGCGG
CTCAACTATG GCCTGCATGT TTCCGATAGT GCCTTGATGA CCTGTATCGT GTTTGAACGG
ATGGGCCGCC AAGTGCATTT TATCGATGGC AACAACGGCG GCTATGCCAA AGCTGCTGAT
CAACTCAAAC AGCAAAGCCA TTATCTCGAA CCACCTGTTA CAACCAAACC TGTGTCACAG
CCAAAAACAG GACTTTTGGG GGATACTTCA TCACCATCGT GGGGGACATG CTGA
 
Protein sequence
MAHNFYATLP IITDFVQITD ANCYHRVPDD WVIVVSDIEQ STKAIGEGRY KDVNFIGAST 
IVALLNLQPN LDIPFVFGGD GATVLLPPWL VEQAKPALQA VQHLSESIYN LHLRVGIMPV
SEVYAHRYQL EIAKFAASDN YAQAMINGDG LTFVEQTIKD PVAGAKYLLA AQSSDQPGLL
DGLECRWQEI PSRYGETVSL LVRAEANTTT QRNAINRQVI EQIEAIYGAD DSHHPVDVQQ
LSLTLRIQDL WGEARLRGGT SKLQQLRYLN NIWWLNVLGK LLLATGAKTE LTDWAEYPQI
LQASTDYRKY DAMLRMVIAG TPEQRQQLEQ FLNAERAAGR LNYGLHVSDS ALMTCIVFER
MGRQVHFIDG NNGGYAKAAD QLKQQSHYLE PPVTTKPVSQ PKTGLLGDTS SPSWGTC