Gene Caul_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1754 
Symbol 
ID5899209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1848823 
End bp1850367 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content67% 
IMG OID641562244 
ProductAltronate dehydratase 
Protein accessionYP_001683381 
Protein GI167645718 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC GAAAAGTTCC AATGACCAAG GCATTGATAG GCGAACCCAG CTCCAGTCCG 
TCCGAATCGA TCCATGGCGT CGATCCGCGC GATCACGTGG CGACGGCCCT GCGCGACCTC
CTGGCGGGCG AGACCCTGGA CCTGCACGGC CAAACGATCG TCGCCAGCAC CGACATCCCC
AAGGGCCACA AGATCGCCGT GCGCGCCGCG CGCCTGGGCG AGGACCTGCT GAAATACGGC
TGGCCGATCG GCCGGGCCAC GGCCGACATC GCGGTCGGCG ACCATGTCCA TGTGCACAAT
GTCGAGACCC GACTGTCGGG CGTCGAGCAC TTCACCTACA CCGCTGGCCA TCCGGAACCG
CGCCAGGCGC CGCCGCTCGC GACCTTCCTG GGCTACCGAC GCAAGAACGG CCGGGTCGGC
ACGCGCAACG AAATCTGGGT GCTGTGCACG GTAGGCTGCG TGGCCAACAC CGCCCGGCGG
ATCGCGGAGA AGGCCAACGC CCGCTTCGCC GGGCGCATCG ACGGCGTCTT CGCCTTCCCC
CACCCGTTCG GTTGCTCGCA ACTGGGCGAC GACCTGACCC ACACCCGGAA ACTGATCGCC
GGTCTCGCCG CCCATCCCAA CGCCGGCGGC GTGCTGATCC TGGGCCTGGG TTGCGAGAAC
AACCAGCTCA AGGCCCTGCT GGAGTCCGCG CCGGGTCTGG ATCCAGAACG CCTGCGCTCG
TTTACCACCC AGATGGTCGA GGACGAATTG GAAGACGGCC TTGACGCCAT CGAGGCGCTG
GTCGAGATCG CTGAACGCGA TCGTCGCGAG CCCATGCCGT TGTCCGATCT GGTGATCGGC
CTGAAGTGCG GGGGGTCCGA CGGTTTCTCC GGCGTGACCG CCAACCCGCT GGTGGGCAGA
ATCGCCGACA AGGTCTCAAA CGCCGGCGGT ACGCCGGTGC TGACCGAGAT CCCCGAGGTG
TTCGGGGCCG AGGGCGTGCT GCTGGCGCGC GCCGCCACCC GCCAGGTGTT CGACCAGGCC
GTCGGGGTGA TCGACGACTT CAAGCGCTAC TTCATCGACA ACCACCAGCC GATCTACGAG
AACCCCTCGC CCGGCAACAT CGCCGGCGGC ATCACCACCC TGGAGGAGAA ATCCCTGGGC
GCGGTGCAGA AGGGCGGACG CTCCATGCTG GTCGAGGTGC TGCGCTACGG CGAACAGGTT
GGGTCCCATG GCCTAACGTT ACTGGAAGCC CCCGGCAACG ACGCCGTCTC GTCCACCGCC
CTGACGGCGG CGGGGGCGAC GGTGATCCTG TTCACGACCG GCCGCGGCAC GCCGCTCGGC
TTCCCGGCTC CAACCCTAAA GATCGCCTCG AACTCGGCCC TGGCCGCTCG CAAGCCGGGC
TGGATCGACT TCGACGCCGG CGTGGTGCTA TCGGGCCAGA CCATGGACGC GGCGGCCGAC
TCTCTGATGG ATCTGGTGGT GGCCACGGCT TCCGGCCAAC AGACCAAGGC CGAACTCAAC
GGCGAGCGCG AAATCGCCAT CTGGAAAACA GGCGTTACCC TTTGA
 
Protein sequence
MATRKVPMTK ALIGEPSSSP SESIHGVDPR DHVATALRDL LAGETLDLHG QTIVASTDIP 
KGHKIAVRAA RLGEDLLKYG WPIGRATADI AVGDHVHVHN VETRLSGVEH FTYTAGHPEP
RQAPPLATFL GYRRKNGRVG TRNEIWVLCT VGCVANTARR IAEKANARFA GRIDGVFAFP
HPFGCSQLGD DLTHTRKLIA GLAAHPNAGG VLILGLGCEN NQLKALLESA PGLDPERLRS
FTTQMVEDEL EDGLDAIEAL VEIAERDRRE PMPLSDLVIG LKCGGSDGFS GVTANPLVGR
IADKVSNAGG TPVLTEIPEV FGAEGVLLAR AATRQVFDQA VGVIDDFKRY FIDNHQPIYE
NPSPGNIAGG ITTLEEKSLG AVQKGGRSML VEVLRYGEQV GSHGLTLLEA PGNDAVSSTA
LTAAGATVIL FTTGRGTPLG FPAPTLKIAS NSALAARKPG WIDFDAGVVL SGQTMDAAAD
SLMDLVVATA SGQQTKAELN GEREIAIWKT GVTL