Gene Haur_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1221 
Symbol 
ID5733114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1412296 
End bp1413285 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content47% 
IMG OID641278361 
Productnucleotidyl transferase 
Protein accessionYP_001543997 
Protein GI159897750 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAA TTATTCCTAC CGCCGGTCTT GGAACCCGCC TACGTCCACA TACCCATACC 
CGTCCAAAGC CGTTGGTTCC GGTTGCAGGC AAAGCCGTAA TTGGCCACCT GCTTGATAAG
CTCAAGGTGC TACCACTCGA CGACGTGGTG TTTATTACCG GGTATTTGGG TACACAAATT
GAGGAATATG TTCGTAAGAA CTATAACTTC AAGAGCCATT TTGTTGAACA AACTGAGCTA
AAAGGCCAAG CCCATGCAAT TGCTTTGGCC CGCGAGATGG TCTCTGGCCC AACCTTGATT
TTATTCGTCG ATACGATTTT TGAAGCAAAC CTGAATGTTT TGAACCAAAC TGATGCCGAC
GGCGTGATCT ATGTAAGCGA AGTTGAAGAT CCCTCGCGCT TCGGGGTGGC ATTGCTCGAA
GATGGGATTA TTACAAAACT GGTGGAAAAA CCAAGTACGC CAGTTTCCAA TTTGGCCTTG
ATCGGCGCAT ACTATGTGCG TGAAGTCAAA GAATTGTTCG CAGCGATCGA TGTGCTGATC
GAGCAGAATA TTCAAACCAA AGGCGAGTTT TATTTGGCCG ATGCACTCCA ACTCATGATT
AGCAATGGCA CGCGATTTAG TGCCGAAACT GCAACCATGT GGGAAGATTG TGGCACAGCC
CCCGCCTTGT TACGCACCAA TCGCTATTTA TTGCAACACG AAACTGGCAA CGTCGAACAA
CGTGATGGCG CGATCATCGT TCCACCAGTC TTTATTGGCG AGAATGTTGA GATTCGCAAC
TCAATTATTG GGCCATACGT CTCAGTTGCT GACCATAGCG TGATTGTCGA TTCGATTGTG
CGCGATTCGA TTATCAATCA AGGAGCCAGC ATTCAATCAT CAACCTTGGA AGGCTCACTT
ATCGGTGAAG GAGCGCATAT CAAAGGCGAA TTCCAACACC TTAACGTCGG TGATTCATCA
GTTATTACAT TTGGTAGCAC GATTCAGTAA
 
Protein sequence
MNVIIPTAGL GTRLRPHTHT RPKPLVPVAG KAVIGHLLDK LKVLPLDDVV FITGYLGTQI 
EEYVRKNYNF KSHFVEQTEL KGQAHAIALA REMVSGPTLI LFVDTIFEAN LNVLNQTDAD
GVIYVSEVED PSRFGVALLE DGIITKLVEK PSTPVSNLAL IGAYYVREVK ELFAAIDVLI
EQNIQTKGEF YLADALQLMI SNGTRFSAET ATMWEDCGTA PALLRTNRYL LQHETGNVEQ
RDGAIIVPPV FIGENVEIRN SIIGPYVSVA DHSVIVDSIV RDSIINQGAS IQSSTLEGSL
IGEGAHIKGE FQHLNVGDSS VITFGSTIQ