Gene Haur_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2603 
Symbol 
ID5734481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3343584 
End bp3344642 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID641279743 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001545369 
Protein GI159899122 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.22182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTG GGATTGTGCT TGGCACGCGG CCTGAGGTGA TGAAAAATTA TGCGATTGTG 
CAGGCATTAC GCGCGGCTGA TTTGCCGTTT GTGGTGCTTC ACACCAATCA GCATCATGAT
CATTTGCTGC AAACCGCGAT TTTTGGCCAA ATGGGCTACA TGCCCGACGA AGTTTTCCCG
GGCAACTACA GCATCGGCGC AGCGATTGAT TGGGTGCGCG AGCAAATTCG CCGCCATGAC
ATCGATTTGA TTTTGGTCAA TGGCGATACT GCGGCGGCCT TGGTTGGGGC AATTGCGGCA
GTCTACTCCG ATGTTGGGTT GGCCCATGTT GAAGCAGGTC TACGAGCTTT CGATAAACGC
ATGTATGAAG AGCGCAATCG GATTATGGTC GATGGCGCAG CCCATTATTT GTTCTCATAC
ACCCAATATC AAGCCGATTA TTTGGCCAAA ATTCCCGATT TGCGTGGGCG AATTTTCAAT
ATTGGCAATA CCACGGTTGA CTTGATTCAT GATTTTGCCC ATGAACTCAC GCCACGCCGC
AACGATACTT ATGCCTACAT CACCTTGCAT CGCAAGGAAT TTACCGATAG CCGCGAATTG
ATGCAACAGG TTTTCAGCAC AATCAATGAG CTGGCCCAAG AATTCGATGC CATGATTTTT
CCGATGCATC CGCGCACGCG GGCGGCCATG GAGCACTATG GTTTGAGCAT GGATCTGCTC
AGTCGGGTGC AGGTACTTGA TCCAGTTGAG CCATTTGAAT CGCTGGCCTA TGAAAAATAC
GCCAACATTA TCATCACTGA TAGTGGTTGT ATTCAAGAAG AAGCTTATAT TTTTGGCGTG
CCCTGTGTGA CGGTACGCGA GAATACCGAG CGGCCTGAAA CGATCGATTC GGGCGCGAAT
GTGGTCACGG GCTTCGAGCC AACCGCAATT ATCGCGGCGG TGCGCAATCA GCGAGCCAAA
AAAGGCCAGC AATTCTCCCC AGTTTACGGC GAACGTGGGG TTGGCCAACG GATCGTAGCA
ACCTTGCAAG CGCATTTTCG CAGTTGGTCG GATTACTAA
 
Protein sequence
MNIGIVLGTR PEVMKNYAIV QALRAADLPF VVLHTNQHHD HLLQTAIFGQ MGYMPDEVFP 
GNYSIGAAID WVREQIRRHD IDLILVNGDT AAALVGAIAA VYSDVGLAHV EAGLRAFDKR
MYEERNRIMV DGAAHYLFSY TQYQADYLAK IPDLRGRIFN IGNTTVDLIH DFAHELTPRR
NDTYAYITLH RKEFTDSREL MQQVFSTINE LAQEFDAMIF PMHPRTRAAM EHYGLSMDLL
SRVQVLDPVE PFESLAYEKY ANIIITDSGC IQEEAYIFGV PCVTVRENTE RPETIDSGAN
VVTGFEPTAI IAAVRNQRAK KGQQFSPVYG ERGVGQRIVA TLQAHFRSWS DY