Gene Haur_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1334 
Symbol 
ID5733226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1544738 
End bp1546813 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content32% 
IMG OID641278472 
Producthypothetical protein 
Protein accessionYP_001544107 
Protein GI159897860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0854082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTG ACGATATTAT CACCGATGGC AAACATATTA CGCAACTACG CACAGTTTTA 
GTAAAATCAA TGGGTGAGAA TGGTGCAGAT AAGCTTATTA ATAATATTCT AGAATTAATT
CAAGAATTAC CTTCTGCCGA AGAAGGAAGT CAAAACCGAC ATGGATTGCT TCTAGGCTAT
ATTCAAAGTG GTAAAACATT TGCTTTCACT ACAGCTATAG CATTAGCGGC AGATAATGGA
TATCGACTCT TTATTATTCT TACTTCTAAT AACCTTATAC TCTATAATCA GACAATTGAT
GAGCGGTTGA AACAAGATTT ACAAAGTATA GAAGTGGAAG GGAAGGATAG TTGGGAACAA
AAGATACTAA TGATGACCCA AACCCTTAAA GATCCTAAGG GTGTTTTAGT ATTAGTTACA
ACAAAAAATA CTGCTATTCT TTACAAGTTA GAACAAACCC TTAGAACAAT TCAAGAAGAG
CTCAAGATGG GCCTTCCTAT AGCATTAATT ATTGATGATG AAGCTGATGA GGGTGGATTA
GATACTAATA CTCGAAGAAG AAGCGTTAAT CCTCTTATAG AGGCTGGGCC TACGTTCAGT
GCTATTGAAG AGATACGTCG TTTAGTTCCT AATCATGTCA GATTACAGGT TACAGCTACT
CCTCAAGCAC TCTTTCTTCA AGATTCTGGA CATGAATCAA GACCTGGTTT TACTGTTTTA
TTGGAACCAG GGGCTGATTA TGTTGGAAGC GAACAGTTTT TTGCGCTGAA ACAAGAAATT
GACATGATTT ATGAAAATGA TGATGAAAAT GAATTAGAGG AACGTAAATC AAAAATTATA
CGAAGAATCG ATCAGCATGA TATCCATATG ATGATTGAAC AAGAAGGTGA TAGTATTCCA
GATAGTCTGC GAGATGCATT ACTAACATTT TATATTGGAG CAACTATCAA GATAGTTGAT
GAACCTAGTA CTAGATTTTC TTTTCTTTGT CATATTAGTG CGAGAAAAGC AGATCATGAT
AAAATTAGTC AAATAATAAA TAAATATATA GGAGTACTTA GAAAATCATT AATAGATTAT
GTTGATAATA ATATCACAAG TGAAGATATA TATTATCTAG AAAAAATATA TACTGACATA
ATAAGTACAT ATGAGGATGG TATTTCATTA GGAACGATAA TTAATGAATT AAGAGAGTCT
ATTATAAAAA CAGATATAAG TGTAATTAAC AGTAGTACGA CCTATCAACC AACATATTCA
GGAAAATATA ATATTTTCAT TGGAGGAACT AAAATAGCGC GTGGGGTCAC CATAAAAAAT
TTAATTGTCA CATATTATGG GAGACAACCA AAAGTAACAA ACATGGATAC CATGCTTCAG
CATGCAAGAA TGTATGGGTA CAGAAAAAAT CATATGGATG TTACAAGACT ATTTATAACT
GAAGAAATTG AAAAAAGATT TACTGTTATT TATGAATCAG AAAAAGCATT ACGTGATTTA
ATAAAAAGAT ATCCTAATGA AAATTATCGC AGTATTATTA TAAATAACAC GGTAAGAGCA
ACAAGAAACA ATGTTCTAAA TAAGTTTAGT ATAGGATATT ACGTTTCTGG AAAGAATTAC
TTACAAAGAT ATCCATATTA CAATAAGTCA GATATAGATA AAACTACTAA AAATATTGAT
GCCATATTGG AAGACTATCC AACTACCGGT ATCAAGACCG AGGAAAAAGA GGTTGATATA
GAAATTCTGA TAGATATATT AAATAATATC CATTCGGTAC CTAGAACTTT TAGTCTTTGG
AATGACAAAA AAATTATATC TGCACTGGAA TTAATGAAGA CAGGAAACAT TACGAGAGGT
CTTTTAATTG TTAGCCGTAA TCGAAATATT GGTAGTAAAG ACAAATTTGG TGCTTTATTA
CCACCCGGCT ATAAAGCCAA AGCAAGCCGA GAATATCCAA CTTTATTTAT ATTCAAAGTT
ACTGGCGAAA ACTGGAATGG AAAACCTTTT TGGATACCTG CAATAACATT TCCAGATACA
AAAGACAAAT ATACTTTTGT CTTTAATCTT TCATAA
 
Protein sequence
MELDDIITDG KHITQLRTVL VKSMGENGAD KLINNILELI QELPSAEEGS QNRHGLLLGY 
IQSGKTFAFT TAIALAADNG YRLFIILTSN NLILYNQTID ERLKQDLQSI EVEGKDSWEQ
KILMMTQTLK DPKGVLVLVT TKNTAILYKL EQTLRTIQEE LKMGLPIALI IDDEADEGGL
DTNTRRRSVN PLIEAGPTFS AIEEIRRLVP NHVRLQVTAT PQALFLQDSG HESRPGFTVL
LEPGADYVGS EQFFALKQEI DMIYENDDEN ELEERKSKII RRIDQHDIHM MIEQEGDSIP
DSLRDALLTF YIGATIKIVD EPSTRFSFLC HISARKADHD KISQIINKYI GVLRKSLIDY
VDNNITSEDI YYLEKIYTDI ISTYEDGISL GTIINELRES IIKTDISVIN SSTTYQPTYS
GKYNIFIGGT KIARGVTIKN LIVTYYGRQP KVTNMDTMLQ HARMYGYRKN HMDVTRLFIT
EEIEKRFTVI YESEKALRDL IKRYPNENYR SIIINNTVRA TRNNVLNKFS IGYYVSGKNY
LQRYPYYNKS DIDKTTKNID AILEDYPTTG IKTEEKEVDI EILIDILNNI HSVPRTFSLW
NDKKIISALE LMKTGNITRG LLIVSRNRNI GSKDKFGALL PPGYKAKASR EYPTLFIFKV
TGENWNGKPF WIPAITFPDT KDKYTFVFNL S