Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2662 |
Symbol | |
ID | 5900117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2893965 |
End bp | 2894963 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563153 |
Product | extracellular solute-binding protein |
Protein accession | YP_001684287 |
Protein GI | 167646624 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.555213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.620572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGC TTATCCCTTC TCGCCGCCAC CTGCTGGTCG GCGGTCTCTC CTTCGCCGGC CTATCGACCC TGGCCGCCTG CTCGCCCGGC AAGACCAAAG ACGCGACGTC GCTCAGCGAC CTGACCCTGC GAGCCGCCAC CTATCGCGGC AATCCCGAGT CGTTCTTCAA GGAGGCCGGG GTCGGCGACA CGCCCTACAG GATCGCCCGT TCGGAATTCG CCAGCGGTAA CCTGATCGCC GAGGCCATCA ACGCCGGCGC GCTCGACATC GGCGGCATGA GCGAGACCCC GCCGATCTTC ATCGCCGGCG CGCCGGGCAA CGACGTGCGG CTGGTGGCGG TGCTGCAGGG CGACGTCAAT AACCAGGTCG TGCTGGTCCC CAAGAACAGT CCGGCCAAGA CCTTCGCCGA CCTGAAGGGC AAGAAGATCG GCTATGTGAA GGCCACCACC TCGCACTACA TCCTGCTGCG CCTGCTGAAC GAGGCGGGAC TGAAATGGAC CGATGTCCAG CCCGTGGCCC TGACGCCGCA AGACGGCCTG GCCGCCTTTT CCAGCGGGGC GATCGACGCC TGGATCATCT ATGGCGTCAT CGTCCAGCAG GCCCGGCGGG CCGGGGCGCG AGTGCTGCGC ACGGCGCTGG GCATATTGTC GGGCAACTAC GTGGTCGCCG CCTCGGTCAA GGCGCTGGAC GATGAGGTGC GCCGCCAGGC TCTGGCCGAC TATCTGGGGC GCTACGCCAA GGTCGTCGAT TGGATCAACG CCGACGGCGA GCGCTGGGCC CAGGTCCGCG CCGCCGCCAC CGGCGTGCCG GCCGAGGACT ATCTGCGCGA GTTCCAGGAA CGCAGCGGCC CGCTGAAGCT GGCGCGGGTC ACCCCCGCCG CCATCGCCTC GCAGCAGAGC GTCGCCGACA CCTTCGCCGC CGCCGGGCTC ATTCCCGGCA AGGTCGACGT CGCGCCGCTG TGGGACACCC GCCTGAACAG CGCCCTACCC GATGCTTAA
|
Protein sequence | MTLLIPSRRH LLVGGLSFAG LSTLAACSPG KTKDATSLSD LTLRAATYRG NPESFFKEAG VGDTPYRIAR SEFASGNLIA EAINAGALDI GGMSETPPIF IAGAPGNDVR LVAVLQGDVN NQVVLVPKNS PAKTFADLKG KKIGYVKATT SHYILLRLLN EAGLKWTDVQ PVALTPQDGL AAFSSGAIDA WIIYGVIVQQ ARRAGARVLR TALGILSGNY VVAASVKALD DEVRRQALAD YLGRYAKVVD WINADGERWA QVRAAATGVP AEDYLREFQE RSGPLKLARV TPAAIASQQS VADTFAAAGL IPGKVDVAPL WDTRLNSALP DA
|
| |