Gene Haur_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1717 
Symbol 
ID5733604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1998599 
End bp1999903 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content51% 
IMG OID641278859 
Producthypothetical protein 
Protein accessionYP_001544488 
Protein GI159898241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATG CCAATCAACC GCTGCACGAC GAACCCTTAC CCGAATTTGA TTTGCCTGAC 
CCCAAAGATG CGATGCCGCA TGAAGATACG CTCAGCGAGG AAGGCACGCG CGGGGTCGTG
CCTGTTGACC AGTTTTTGTA TGTCGGTCAG GGTCTGAAAG CCGATGAATT TACCCATTAC
GTCGATACCT ATAACTTTGG GGCTGTGCCA CCTAACTTTG TGGTGTTGCA CCATACCGCT
GTACCAAGTA CTGCCGCTGC TCCTTACCCC TCAGGCTGGC GCTGGGATAA CCAAGAAACT
GGTTTGAGCG AAGGCCAAAT TTACCGCAAA CGCCTCAAAC AGCTCGAAAC CCTGCGCGAA
TACTATCGCA CCAGCGCTGG CTGGGATCGT GGCCCACACT TATTTATCGA TGAAACCTGG
ATTTGGTTGT TCACGCCGAT GTACGATCAA GGGATTCACG CCGCTCAAGG TAATGGCTAT
CGCGATAGTA AAGGTACATT GCAATATTCA ATCGGCATCG AAGTATGTGG CTATTTTGAA
AAAACCCAGT GGTCGGCTCC CGTTGCAGCC CTGGTTGGCC ATGCGGTGGC AGTGCTCAAA
CGCAAATTAA ACAGCTTTGA GATTCGCTAT CAGAAATTTG GTGGTGGCAT TTCATCGCAC
CGCGATTACA ATAAACCTTC ATGTCCAGGC GCAGCGATCA CCGAATCCTA TTACATCAGT
ACAATTCAAA ATGCCTTTGA TCGTTTGAGT AATGTTCAAA GCACGCCAAT TGCTGATAAT
CCAATTACTA CCAATACCCC ATTGTTGAGC GCAACGCCCT CAGGCAGCCG CGAAAAGGCA
ATCGCCTTTA TTCGTAAGAG CCTGCCCGAT AATTCCGAAT ATAAGAACGA TATTGAAACG
ATTATGGGGT ATTACTGGGT CTATGCCCCC AGCGTTGGGC TTGATCCCTT CCTCGCTGCA
TCGCAATGTA TTTTTGAAAC CGCTGGCTTG ACTTCAGGCT GGGCTGCGCG GCCAAAACGT
AATCCGGCTG GTTTGGGCGT GCGCCAAGAG GGCGGCCTTT CGTTCAGCAC CTGGGATGGC
GCAGTCCAAG CGCATATTGG GCAATTATTG GCCTTGGCCT TGCGTGATGA TGAGGCTAAT
CAAGCCCAAA AAACCATGAT GGCTGCCAAT CCACGCCATG GCAACATTCC AGCCAATCTG
CGTGGGGTTG CCAAAACGTT GGCTGGCTTG AGCAATAATT GGACTGACGA TGCCGATTAT
GCCAACAAGT TTGCGACTCG CGCCGAGGCA ATTCGCAAGG GCTAG
 
Protein sequence
MTDANQPLHD EPLPEFDLPD PKDAMPHEDT LSEEGTRGVV PVDQFLYVGQ GLKADEFTHY 
VDTYNFGAVP PNFVVLHHTA VPSTAAAPYP SGWRWDNQET GLSEGQIYRK RLKQLETLRE
YYRTSAGWDR GPHLFIDETW IWLFTPMYDQ GIHAAQGNGY RDSKGTLQYS IGIEVCGYFE
KTQWSAPVAA LVGHAVAVLK RKLNSFEIRY QKFGGGISSH RDYNKPSCPG AAITESYYIS
TIQNAFDRLS NVQSTPIADN PITTNTPLLS ATPSGSREKA IAFIRKSLPD NSEYKNDIET
IMGYYWVYAP SVGLDPFLAA SQCIFETAGL TSGWAARPKR NPAGLGVRQE GGLSFSTWDG
AVQAHIGQLL ALALRDDEAN QAQKTMMAAN PRHGNIPANL RGVAKTLAGL SNNWTDDADY
ANKFATRAEA IRKG