Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0226 |
Symbol | |
ID | 5897500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 242157 |
End bp | 244394 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641560710 |
Product | TonB-dependent receptor |
Protein accession | YP_001681861 |
Protein GI | 167644198 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.385209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.613725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA TGCTCAAGCT CGCGCTGCTG GCTGGCGCCG CCTGGTCCGC GGCCGCCACC ACCGTGGCCG CTCAAGACAC CGCGCCGGCG CCGACCTCGG ACGGCTTCGC CATTGAACAG GTCGTCGTCA CGGCACGTCG CCGCGAGGAA AGCCTGCAGG ACGTGCCTGT CGCGGTCTCG GCCTTTTCGG CCGCGCGGCT GGAGCGAACC GGCGCTCAGG ACATCACCGA ACTGACGCGC TCGGCGCCCA GCCTGACCAT TCAGGCGGCG CGCGGTTCGA ACTCGACGCT GATCTCGTTC ATTCGCGGCA TCGGTCAGCA AGACCCGTTG TGGGGCTTCG AGCCCGGCGT CGGCCTCTAT GTCGATGACG TCTACATCGC CCGCCCGCAG GCTGCCGTGC TGGACATCTT CGACATTTCC CGCATCGAAG TGCTGCGGGG GCCGCAGGGC ACCCTCTACG GCCGCAACAC CATCGGCGGC GCCATCAAGT ACGTGACCGA CAAGATCGGA TCGGAAAACG AGGCGACGAT CAAGGGGGCC TACGGCTCAT ACAATCAGCG CGAACTGGTC GCCAGCGGCA AGGCGCGCCT CACAGACACC TGGGCGGTGT CCGGCGCCTT GGCCCGCTAT CTGCGCGACG GCTACGGCAA GAACCTCAAC ACCGGGGCCG AGCACTACAA CAAGGACGTC TGGGCCGGCC GGGCCAGCGT CGAATGGCAG CCGACGCAAG ACGTGTTCTT CCGCCTGGCG GGCGATATCA CCCGTGACGA CTCCAACCCC CGCCACGGGC ACCGTGAGAT CGCGCCGATC CCCTCCAGCG TCTACGACAC CAACGCCGGG GCCGGCGACA AGAACAAGGT CGAGGCGCGC GGCGTGTCGT TGCTCGCCCA GTGGGACGTC GACGACCAGT TGACCCTGAA GTCGATCACC GCCTATCGGG CGGGCGAGAC CGACGGCGTC ATCGACTTCG ACAACCTTCC CGGGCCGCTG CTCGACATCC CGGCCGCTTA TCGCGATCAC CAGTTCTCGC AGGAACTGCA GGCGCTGTAC GAAGGCGACA GAATCCATGC CGTGGGCGGC GTCTACTATC TGAGCGCCAC CGCGTCGGGC GCCTTCGACA CCGTGGTGGG GGGCGCCAAC CTGACAACCC TGACCCAGGG TTATGTCGAT ACCGAGAGCG TCTCGGCCTT TGGCGACGTC AGCTACGATC TCACCGATCG CCTGTCGCTC TCGGTCGGGG GGCGCATCAC CCGCGACAAG AAGACCGGCA ATGTCTTCCG GCAACGGTAT CTCGGAATCC GCAGCCCGTT CTTCGGCAAT CCCGCCGCCA TCGCCTTCGA GGCCCCGCGC ACCAACTACA CGCGCACGGC CACGTTCGAG AAGTTCACGC CTCGCGTCAG CGCCAGTTAC AAATTCTCGC CGGACCTGAC GGGCTATGCC TCGTGGGGCA AGGGCTTCAA GTCGGGCGGT TTCGACATGC GCGGCGACAA GGTCGCCTAT CCGGCCACCG ACCAACCCTA CAGCCCCGAG AACGTCGAGA CGGTGGAGCT GGGCCTGAAG GGCTCGCTCA TGGATCGTCG CGTGACCTTC GCCACGGCGG TGTTCGACAC CAACTACAAA GATATGCAGA TCACTACCCA GTTTCCGACC GCCACGCCGG GCGTCGTCGC CTCGGTGGTG GACAATGTCG GCAGCGCCTC GATCCGGGGC TGGGAGCTGG AGAGTTCCGC GGTCATCAGC TCCAGCTTCG TCGCAAACCT GATGCTCAGC TACATCGACG CCAAGTTCGA CCAGTTTCTG GGCTACGTGC CGACCGGCCC GGCCAACGCC AGCTGCCCGA CCCTGCCCGG CTGTGTCGTC GACTTGTCGG CCGTGCGAGC CTTCCAGAAC ACCCCCGAAT GGACCGGTTC GGCCAGCTTC ACCTACACCC ACGACATGGG TTCCAACGGC AAGATATCGT TCACGCCGAC GGCGTCCTAT CGCGGCGCCT ACCAGTTGTT CGAGGCGCCG CAGCCTATCC TCGACCAAGG CGCCTACTGG CTCTATGACG CCAGCCTGGT CTGGACCTCG GCCGATGATC GCTACCAGAT CGGCCTGCAC GGCAAGAACC TGGGCGACGA GGAGTATCGC GTCGGTGGCT ACGATTTCAG CTCCTTCGGC GCCCTGACCG GCAATACGGT GATCGGCTTC TACGGCCCGC CGCGGTCGGT GACCCTGTCG CTGCAAGCCA AGTTCTAG
|
Protein sequence | MNKMLKLALL AGAAWSAAAT TVAAQDTAPA PTSDGFAIEQ VVVTARRREE SLQDVPVAVS AFSAARLERT GAQDITELTR SAPSLTIQAA RGSNSTLISF IRGIGQQDPL WGFEPGVGLY VDDVYIARPQ AAVLDIFDIS RIEVLRGPQG TLYGRNTIGG AIKYVTDKIG SENEATIKGA YGSYNQRELV ASGKARLTDT WAVSGALARY LRDGYGKNLN TGAEHYNKDV WAGRASVEWQ PTQDVFFRLA GDITRDDSNP RHGHREIAPI PSSVYDTNAG AGDKNKVEAR GVSLLAQWDV DDQLTLKSIT AYRAGETDGV IDFDNLPGPL LDIPAAYRDH QFSQELQALY EGDRIHAVGG VYYLSATASG AFDTVVGGAN LTTLTQGYVD TESVSAFGDV SYDLTDRLSL SVGGRITRDK KTGNVFRQRY LGIRSPFFGN PAAIAFEAPR TNYTRTATFE KFTPRVSASY KFSPDLTGYA SWGKGFKSGG FDMRGDKVAY PATDQPYSPE NVETVELGLK GSLMDRRVTF ATAVFDTNYK DMQITTQFPT ATPGVVASVV DNVGSASIRG WELESSAVIS SSFVANLMLS YIDAKFDQFL GYVPTGPANA SCPTLPGCVV DLSAVRAFQN TPEWTGSASF TYTHDMGSNG KISFTPTASY RGAYQLFEAP QPILDQGAYW LYDASLVWTS ADDRYQIGLH GKNLGDEEYR VGGYDFSSFG ALTGNTVIGF YGPPRSVTLS LQAKF
|
| |