Gene Haur_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1066 
Symbol 
ID5732970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1218561 
End bp1219562 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content52% 
IMG OID641278201 
Productribokinase-like domain-containing protein 
Protein accessionYP_001543842 
Protein GI159897595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 
TIGRFAM ID[TIGR02198] rfaE bifunctional protein, domain I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCATGA TCACGGTTGA ACATGTGGCC CAACTCGCCA ATCGCCGCGT ACTGGTGGTT 
GGCGATGTTG TGCTCGATGA ATATTTGTAT GGCAAGCCCG AACGGCTCTC GCGCGAGGCA
GCAATTCCGG TTTTAGAATT TGAGCAGCGG CGGATTATCC CTGGTGGTGC GGCCAATCCC
GCCGCCAACA TTACAGCGCT CGGCAGCAAT GCTGGCATCG TGGCGTTAAT CGGCGCTGAT
CAGGCTGGTC AAGAATTAGC CAATGCCTTA CATAAACGCA AGGTCAGCAC CGCTGGCTTG
CTGCGCGATG AGCAACGCCC AACCACCACC AAAACCCGTA TTTTGGCCTC GGTGCAATTG
ACCGTGGCCC AACAAGTCGC GCGGCTCGAT AAAATTGATC GGCGGCCGGT TGACCCAGCT
TTTGAAGATC AGGCGATCGA ATTATTAGGC CAATTAATTC CCCAAGTTGA TGCTGTGCTG
TGCTCAGATT ATCGCGTGGG CTGGCTTAGT GAGCGGCTTA TTCAACACAT TCAACAATTA
TGTCAACAAT ATCAAACCTT GCTGACAGTT GATAGCCAAG GCCGCTTTGA ACCCTACGCT
GGAGCCGATT TTCTCAAGTG CAATTTGGGC GAGGCTGAGG CTTGGCTTGG CCAGCGTTTA
ACCAATGATC AGCAGGTTGA ACAAGGCTTA GAACGCTTGC GCGATCAGCT CAAATTGGCG
GCAGTGGTGA TTACCAGAGG CGGGGCAGGC TTTTCGTTGC TTGATCCAGC AGGCATTCAT
CATATTCCAG CAGTGCCAAT TGGCGAAGTT TTTGATGCAA CCGGGGCGGG CGATACCTTT
ATTGCCACCG CAACGCTTAG TTTGTGTGCT GGCCATAGCC CATTAATTGC CGCCCAACTT
GCTAATACTG CGGCGGCCTT GGTAGTACGA CGAATTGGCG TGGCCACAGT TAGCCCCAAC
GAACTGCAAA ACGCATTAAT TCAATTTGGC CAGATTGTAT GA
 
Protein sequence
MCMITVEHVA QLANRRVLVV GDVVLDEYLY GKPERLSREA AIPVLEFEQR RIIPGGAANP 
AANITALGSN AGIVALIGAD QAGQELANAL HKRKVSTAGL LRDEQRPTTT KTRILASVQL
TVAQQVARLD KIDRRPVDPA FEDQAIELLG QLIPQVDAVL CSDYRVGWLS ERLIQHIQQL
CQQYQTLLTV DSQGRFEPYA GADFLKCNLG EAEAWLGQRL TNDQQVEQGL ERLRDQLKLA
AVVITRGGAG FSLLDPAGIH HIPAVPIGEV FDATGAGDTF IATATLSLCA GHSPLIAAQL
ANTAAALVVR RIGVATVSPN ELQNALIQFG QIV