Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3991 |
Symbol | |
ID | 5901453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4321004 |
End bp | 4322029 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641564512 |
Product | ApbE family lipoprotein |
Protein accession | YP_001685614 |
Protein GI | 167647951 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.604582 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCG TCCTCGTCCC CCAGCTCGCC GAGCCGCCCG CCCGCCCGAT CGGCGGTGCG GTGCTGGCGC TCGCCGGCCA GACGATGGGC ACGACCTGGT CGGTCAAGCT GGTGGCGCCG CCGACGGCCA ACGCCGAGGC CCTGACGGCC ATGGCCCAGC GTGAGCTCGA CGCCGTGGTC CGCGAGATGA GCCCGTGGGA GCCGGAGTCC GATCTCTCCC GCTACAACCG CGCCGCCGCC GGGAGCTGGA CCGCCCTGCC CCCGGCCTTC GCCCAGGTGC TGCGCTGCGC CCTGGAGATC GCCGAGGCGA CCGACGGAGC CTTCGATCCG ACGCTGGGCG GCCTGGTCGA CCTCTGGGGT TTCGGCCCCC GCCCCTTCTC CGGCGCGCCG CCGCGAGCCC GAGACATCGC CATCGCTCGC GAGACCGCCG GCTGGCGCCG CCTGGTCCTC GACGGCGACA GCCTGTTGCA ACCCGGCGGC CTGCGCCTGG ACCTCAATGG CGTCGCCAAG GGCTTCGCGG TCGATCAGGT CGCCGCCGCC CTGGGCCGGG CCGGCGCGCG CTCGTACCTG GTCGAGGTGG GCGGCGAGCT GCGCGGGACC GGCGCCAAGC CCGACGGCCA ACCCTGGTGG GTCGAGCTGG AACGCCCGCC GGCCGCGCCC GCGCGCGGAT GCGCGCCTCT TCCGCTAGTT GATGACGGCC CGCGCACCCT GGTCGCCCTG CACGATCTGT CGGCCGCCAC CTCGGGCGAC TACCGCCGGT TCTTCGAGCA CGACGGCCGT CGCTACGCCC ACACCCTGGA CCCCGCCACG GCCGCGCCGG TCACCCATTC GACGGTCAGC GTCACCGTGC TCGACCAGAG CTGCATGCGC GCCGACGCCT ACGCCACCGC CCTGACCGTG ATGGCGCCCG ACGCCGCCCT GGCCTTCGCC GCCGCCCATG GCCTGGCCGC CCTGATCCTC GCCAACGGCG CGCACGGCCT GGAGGAGCGC CTGTCGCCGG CCCTTGAGGC GATGCTCGAC GCATGA
|
Protein sequence | MTRVLVPQLA EPPARPIGGA VLALAGQTMG TTWSVKLVAP PTANAEALTA MAQRELDAVV REMSPWEPES DLSRYNRAAA GSWTALPPAF AQVLRCALEI AEATDGAFDP TLGGLVDLWG FGPRPFSGAP PRARDIAIAR ETAGWRRLVL DGDSLLQPGG LRLDLNGVAK GFAVDQVAAA LGRAGARSYL VEVGGELRGT GAKPDGQPWW VELERPPAAP ARGCAPLPLV DDGPRTLVAL HDLSAATSGD YRRFFEHDGR RYAHTLDPAT AAPVTHSTVS VTVLDQSCMR ADAYATALTV MAPDAALAFA AAHGLAALIL ANGAHGLEER LSPALEAMLD A
|
| |