Gene Arth_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1162 
Symbol 
ID4446328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1261533 
End bp1262690 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID639688969 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_830656 
Protein GI116669723 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACA TCACCAGCAC CAACCTTGCC GATGGCCGGG AGCTGCTCTA TTTCGATGAC 
GCTACGGCAG GGGCCCCCGG GAGCGGAAAC ACCCGCCGGC CGGAGGAGAC TCCGGACCGC
CGCGAACTGC CGGCCCGGAG CGAGCCGGGC GAAGTCAGGT TCGACGCCCT CACCGGCGAG
TGGGTGGCCG TGGCTGCCCA CCGCCAGAGC CGCACCCACC TTCCCCCGGC GGATCAGTGC
CCCATCTGCC CGACGACCCC GGCCAACCCC TCGGAGATTC CGGCACCGGA CTACGACGTG
GTCGTTTTCG AGAACCGCTT CCCCTCGCTC GGGCCCGCCC TCGGGCCGGT TCCCGCCGAC
GCCGGCTGGG GAACCACCGG CACGGCGTTC GGCCGGTGCG AGGTGGTGTC CTTCACCCCG
GAGCACACCG GTTCCTTCAG CGGGCTGAGC GAAGTCCGGG CACGTACCGT CGTCGAGGCC
TGGGCACACC GCACCGGGGC CCTCAACGCG CTTCCGGGCA TCCGGCAGGT CTTTCCGTTC
GAGAACCGCG GCGCGGACAT CGGCGTCACG CTTCACCACC CGCATGGCCA GATCTACGCA
TACCCCTACG TCACGCCCCG CGCCGCGGCG ATGGGCGCCG CGGCAAGGAA GTTCTACGAC
GACGCCGACG GCCGCCAGAC GCTGACCGGC TCGCTGTTGC GGTCTGAACG TGAAGACGGC
AGCCGAATGG TCCTCGAAGG CGAGAACTTC AGCGCCTACG TCCCGTTCGC GGCCCGCTGG
CCGCTGGAAG TGCACCTGGT CCCGCACCGC CAGGTACCGG ACCTCGCGGC GCTCAGCGGC
GAGGAGAAAG ACGAGCTGGC GCACGTGTAC CTGGACCTGC TCAAGCGTCT CGATGCGCTC
TATCCGACGC CGACCCCCTA TATTTCGGCC TGGCACCAGG CCCCGCTCGA CGACCTGCTC
CGCCCGGCCG GCTACCTCCA CCTCCAGCTG ACCTCCCCGC GGAGGGCCGA CGATAAGCTC
AAGTACCTGG CCGGTTCCGA AGCGGCCATG GGTGCTTTCA TTAACGACAC CACCCCGGAA
CTCGTGGCGG AGCGGCTGCG CACCGTTACG GTTCCGGCAT CAGTCCCCAA GCCACTCCCG
GAAGGCGCAC ACGCATGA
 
Protein sequence
MTHITSTNLA DGRELLYFDD ATAGAPGSGN TRRPEETPDR RELPARSEPG EVRFDALTGE 
WVAVAAHRQS RTHLPPADQC PICPTTPANP SEIPAPDYDV VVFENRFPSL GPALGPVPAD
AGWGTTGTAF GRCEVVSFTP EHTGSFSGLS EVRARTVVEA WAHRTGALNA LPGIRQVFPF
ENRGADIGVT LHHPHGQIYA YPYVTPRAAA MGAAARKFYD DADGRQTLTG SLLRSEREDG
SRMVLEGENF SAYVPFAARW PLEVHLVPHR QVPDLAALSG EEKDELAHVY LDLLKRLDAL
YPTPTPYISA WHQAPLDDLL RPAGYLHLQL TSPRRADDKL KYLAGSEAAM GAFINDTTPE
LVAERLRTVT VPASVPKPLP EGAHA