Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3951 |
Symbol | |
ID | 5901413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4277860 |
End bp | 4278960 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641564472 |
Product | extracellular solute-binding protein |
Protein accession | YP_001685574 |
Protein GI | 167647911 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0469175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCA ACCGCATTCT CGGCACACGC CGCTCGCTGC TGACCGCCAT GGGCGCGGCG GCCATCGGCA TCAGCTTCAC CGCCTGTGGT CAGAAGCCCA AGGGCGAAGC GACCAAGTCG CCGAACGGCG AAGAGCCCAA GCTGAACTTC TACAACTGGG ACACCTATAT CGGCGAGACG ACCCTGGGCG ACTTCAAGAA GGCCACCGGC GTCGACGTCA ACATGAGCCT GTTCGCCACC AACGACGAGC TGTTCGCCAA GCTGAAGGCG GGCAACGCCG GCTATGACGT GATCGTGCCG TCCAACGAGT TCGTCACCCG CATGGGCCAG GCGGGGATGC TGGAGCCGCT GGACCACGCC AAGATCCCCA ACATCAAGAA CATCGACCCG GCCTTCCTCG ACCCCGACTA CGACAAGGGC CGCAAGTTCT CGATGCCCTA TACCTGGCTG GTGCTGGGCA TCGGCTATCG CAAGTCCAAG GTCAACGGCG TTCCGGACAG CTGGAAGTAC CTGTTCGACA GCGCGCAATA TAACGGCCGC ATCGCCCTGC TGTCGGAAAG CGCCGACCTG ATCCGCCTGG CCGCCAAGTA CAAGGGCCAC AGCGTCAACA ACATCCCGCC CGAGCTGGTC ACCGAGATCG AGAAGATGCT GATCAAGCAG AAGCCCTATG TGAAGGCGTT CCACGACGAC AATGGCCAGG ACATGCTGGT GGCCGGCGAT GTGGACCTGG TGCTGGAATA TAACGGCGAC ATCGCCCAGG TGATGAAGGA CGATCCGGAC ATCGACTTCG TGATCCCCAA GGAAGGGTCG CTGATCAATT CCGACACCCT GTGCATCCCC AAGGGCGCGC CGCGTCCCGA TAACGCCCAC AAGTTCATCA ACTACCTGCT GGACGCCCAG GCCGGGGCCG AGATCTCCAA GACCATCCTC TACCCGACCC CGAACGCGGC CGCCAAGGCG TTGATGCCGC CGGAATATCG CGACAACAAG GTGATCTTCC CGCCGGCCGA CATCATGAGC AAGTGCGAGT ACGGGGCGTT CGAGGGAGCC GAGAAAGCCA GCCTGTACGA GGAAGTCATC ACCCGGGTGC GGGCGGCTTA G
|
Protein sequence | MSTNRILGTR RSLLTAMGAA AIGISFTACG QKPKGEATKS PNGEEPKLNF YNWDTYIGET TLGDFKKATG VDVNMSLFAT NDELFAKLKA GNAGYDVIVP SNEFVTRMGQ AGMLEPLDHA KIPNIKNIDP AFLDPDYDKG RKFSMPYTWL VLGIGYRKSK VNGVPDSWKY LFDSAQYNGR IALLSESADL IRLAAKYKGH SVNNIPPELV TEIEKMLIKQ KPYVKAFHDD NGQDMLVAGD VDLVLEYNGD IAQVMKDDPD IDFVIPKEGS LINSDTLCIP KGAPRPDNAH KFINYLLDAQ AGAEISKTIL YPTPNAAAKA LMPPEYRDNK VIFPPADIMS KCEYGAFEGA EKASLYEEVI TRVRAA
|
| |