Gene Haur_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4165 
Symbol 
ID5736026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5311411 
End bp5312478 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content50% 
IMG OID641281319 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_001546925 
Protein GI159900678 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.387622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGTT TAGTTCTGAG TGGTGGTAAA GGCACACGCC TGCGCCCAAT CACCTATACC 
AGCGCCAAGC AATTAGTGCC TGTCGCCAAC AAGCCAGTAC TTTTTCGGGT AATCGAAGCC
CTGCGCGATG CCAATATTGA TGAAATTGGG ATTGTAATCG GTGATACTGG GGCCGAAGTG
CGCAACGCCG TGGGCAATGG CTCACGCTGG GGCGTAAAGA TCGAATATAT TCCCCAAGAA
GCTCCGCTCG GCTTGGCTCA CGCGGTCAAA ATTAGTCGTC CGTTTATTGG TGACGATAAA
TTTGCACTCT TTTTGGGCGA TAACTGCATC GAAGGCGGAG TTAGTTCGTT GGTGTCGGGC
TTTGCTACAT CCGATTACAA TGCCCAAATT GTGCTCAAAC AAGTTGCCAA TCCACAGCAA
TATGGTGTCG CTGAGTTGCG TCACGATGGC TCGATCGAAC GCTTGACTGA AAAACCGCGC
CAACCGCGCT CAGACTTGGC GTTGGTCGGC ATTTATATGT TCGATCAGCA TATTTGGGAA
GCTGTTGAGG CGATCAAGCC TTCTTGGCGG GGCGAGTTGG AAATTACCGA TGCAATTCAA
TGGCTGATCG AGCACGATTA CCATGTTCAT GCCCATATTC ACCAAGGCTG GTGGATCGAT
ACTGGCAAAC GCGCTGATAT GCTCGATGCC AATCGTTTGG TGCTCGAAGA AATCACGCCC
CATGTATCGG GCTTCGTTGA TCGCGATTCG CAATTAGTTG GCAAAGTAAC GATCGAAAAA
GGCGCTCAAG TGATCAACAG CGTGATTCGT GGGCCAGCGA TCATCGGCGA AGATACGCGG
ATCGTTAACT CATACGTTGG ACCGTTTACC TCAATCTATC ATCATTGTAC GATTGAGGAA
AGCGAAATTG AGCACTCGAT TGTCTTGGAA AATAGCGAAA TTATTCGCCT GCCCAACCGC
ATCGAAGATA GCTTGATTGG CCGCAATGTC AAATTGCATA CCTCGCCGAT GAAACCCAAA
GCCTACCGCT TGATGCTCGG CGATAACTCT GACGTGGGGC TGTTGTAA
 
Protein sequence
MKGLVLSGGK GTRLRPITYT SAKQLVPVAN KPVLFRVIEA LRDANIDEIG IVIGDTGAEV 
RNAVGNGSRW GVKIEYIPQE APLGLAHAVK ISRPFIGDDK FALFLGDNCI EGGVSSLVSG
FATSDYNAQI VLKQVANPQQ YGVAELRHDG SIERLTEKPR QPRSDLALVG IYMFDQHIWE
AVEAIKPSWR GELEITDAIQ WLIEHDYHVH AHIHQGWWID TGKRADMLDA NRLVLEEITP
HVSGFVDRDS QLVGKVTIEK GAQVINSVIR GPAIIGEDTR IVNSYVGPFT SIYHHCTIEE
SEIEHSIVLE NSEIIRLPNR IEDSLIGRNV KLHTSPMKPK AYRLMLGDNS DVGLL