Gene Caul_4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4598 
Symbol 
ID5902060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4970009 
End bp4971829 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content69% 
IMG OID641565117 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_001686216 
Protein GI167648553 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.510403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.651872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGCA TCATCGGCAT CGTGGGCAAA GCGCCCGTCT CGGAGCGGCT GATCGACAGC 
CTCAAGCGGC TGGAATATCG CGGCTACGAC TCGGCCGGCG TCGCCGCCGT GGTTGGCGTA
AAGGTCGAGC GCCGGCGCGC CCAGGGCAAG ATCAAGAACC TGGAGGCCTT GCTCGCCGAG
GAGCCGCTGG TTGGACAGAA CGGCATCGGC CACGTCCGCT GGGCCACCCA CGGCGCGCCC
AACCTCAGGA ACGCCCACCC CCACACCGCC GGCCGCGTCA CCCTGGTGCA CAACGGCATC
ATCGAGAACT TCGCCGAGCT GAAGGCCGAG CTGGCCGCCG CCGGCCGCAC CTTCGAGAGC
GACACCGACA CCGAGGTCAT CGCCCACCTG ATCGACGCCG AACTGGCCAC CGGCCTCGAG
CCGCTCGCGG CCTTCAAGAC AACGCTGGAC CGGCTGACCG GCGCCTACGC CCTGGCGGTG
CTGGTCGAGG GCGCCGACAA CCTGATCCTG GGCGCCCGGC GCGGCAGCCC CCTGGTGGTG
GGCGAGGGCG AGGGCGAGAT GTTCCTGGGC TCCGACGCCC TGGCCGTCGG CCCGTTCACC
AACCGGGTGA TCTATCTGGA AGAGGGCGAC TACGTGGCCA TCGACCACGA CAGCGCCCGG
ATCTTCGACG CCTCGGGCGC GCCCGTCACG CGGCCGGTCA AGGTGGTCCC CGCCTCGGCC
GTGATGATGG AAAAGGGCAA CTACCGGCAC TTCATGGAAA AGGAGATCCA TGACCAGCCG
GAGGGCTGCC AGCGCACGAT CTCGGCCTAT GTCGACGCCC TGACCGCCCG CACCGCCATG
CCCGGCGATA TCGACTTCAA GGCGCTGGAA CGCATCCAGA TCGTCGCCTG CGGCACCTCC
TATATCGCCG GCGTCATCGG CAAGTACCTG ATCGAGCAAC TGGCCGACCT GCCGGTCGAC
GTCGAGATCG CCTCGGAGTT CCGCTACCGC CAGCCCGCCC TGCGGCCGGG CTCGCTGGTC
ATCGCCATGT CGCAGTCGGG CGAAACCGCC GACACCCTGG CGGCCCTGCG CTACTGCAAG
GCCAAGGGCA TGAAGAGCGC CGTGGTCGTC AACGCCCAGG AATCCACGAT GGCCCGCGAG
GTCGACGTGG TCTGGCCGAT CCATTGCGGG CCCGAGATCG GCGTCGCCTC CACCAAGGCC
TTCACCGCCC AGGTCAGCGT GATGATCGCC CTGGCCGTCG CCGCCGCCAA GGCGCGCGGG
ACGATCGACG CGGCCGAAGA GCAGCGGATG GTCAAGGTGA TGCTGGAGGC CCCGCGCCTG
ATCGCCGAGG CCATCGGCCT GGAGGACGCC CTCAAGGAGA TCGCCTTCGA CATCGCCAAG
GCCCGCGACG TCCTGTTCCT GGGTCGCGGG CCGATGTCGG CCCTGGCCCT GGAAGGCGCG
CTGAAGCTAA AGGAAATCAG CTACATCCAC GCCGAGGGCT ACGCCGCCGG CGAGCTGAAG
CACGGCCCGA TCGCCCTGGT CGACGACCAG ACCCCGATCA TCATCCTGGC CCCCTATGAC
AGCTATTTCG AGAAGTCGGC CTCGAACATG AGCGAGGTGA TGGCGCGCGG CGGCCAAGTG
GTGTTCATCA CCGACCCGGA AGGCGCCAAG CACGCCCCGG CCGGCGCCCG CGTCGTCGTC
ACCGCCCCGG CCAGCGACCC GCTGGTCTCG ACCCTGGTGA TGTCGGCCCC GATCCAGCTG
CTGGCCTATC ACGTCGCCGT GGTGAAGGGC GCGGACGTCG ATCAGCCCCG CAACCTGGCC
AAGTCGGTGA CAGTGGAGTA G
 
Protein sequence
MCGIIGIVGK APVSERLIDS LKRLEYRGYD SAGVAAVVGV KVERRRAQGK IKNLEALLAE 
EPLVGQNGIG HVRWATHGAP NLRNAHPHTA GRVTLVHNGI IENFAELKAE LAAAGRTFES
DTDTEVIAHL IDAELATGLE PLAAFKTTLD RLTGAYALAV LVEGADNLIL GARRGSPLVV
GEGEGEMFLG SDALAVGPFT NRVIYLEEGD YVAIDHDSAR IFDASGAPVT RPVKVVPASA
VMMEKGNYRH FMEKEIHDQP EGCQRTISAY VDALTARTAM PGDIDFKALE RIQIVACGTS
YIAGVIGKYL IEQLADLPVD VEIASEFRYR QPALRPGSLV IAMSQSGETA DTLAALRYCK
AKGMKSAVVV NAQESTMARE VDVVWPIHCG PEIGVASTKA FTAQVSVMIA LAVAAAKARG
TIDAAEEQRM VKVMLEAPRL IAEAIGLEDA LKEIAFDIAK ARDVLFLGRG PMSALALEGA
LKLKEISYIH AEGYAAGELK HGPIALVDDQ TPIIILAPYD SYFEKSASNM SEVMARGGQV
VFITDPEGAK HAPAGARVVV TAPASDPLVS TLVMSAPIQL LAYHVAVVKG ADVDQPRNLA
KSVTVE