Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1970 |
Symbol | |
ID | 5899425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2113133 |
End bp | 2116063 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562460 |
Product | TonB-dependent receptor |
Protein accession | YP_001683597 |
Protein GI | 167645934 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCTA GAATTTTCGC GCTCTTTGGC GCCGTGTCGA CGCTCGCCTT GAGTCTGGCT CACGCCGGGG TCGCCAACGC CCAGGCCTCA CGGCCAACGG CGGCCGCCGC CACCTCGGTC GAAGAGGTCG TGGTCACCGG ATCGCGCATC GCGCGCCAGG ACTACACCGC CCAAAGCCCG ATCGTCACGA TCGGCAAACA GATTCTCCAG GCCCAGGGGC CGACCTCGAT CGAGAACACC CTCAACCAGT TGCCGCAGTT CGCCGCCACC TCCGGCTCGA CCTCGGCCAG TCAGGCCGGC GGCGGACGGG CCAACGCCAA CCTGCGCGGC CTGGGCACGG CCCGCACCCT GGTGCTGCTG GACGGCCGAC GGCTGCAGCC GTCCGACCCG CTGGGCGCCA TCGACCTCAA CACCCTGGCG CCCAGCCTGA TCGACTCGGT CGAGGTGATC ACCGGCGGCG CCTCGGCGAT CTACGGCTCC GACGCGATCG CCGGCGTCGT CAACTTCAAG CTCAAGCACG ATTTCGAGGG CCTGGAGATC GACGCCCAAT ACGGCCAGAC CGAGCGCAGC GACGGCGAGA CCGCGGATCT GAACGCCACG GTCGGCGGCG CCTTCGCCGA CGACAAGGGT CACGTGATGC TGTCGGCGTC CTATATGGAT CGCCAGCGGG TCAACCGAAA TTCCCGCGCC TTCTTCAAGG ACGGCGGCAT CACCTCGGTG CTGCCCAGCG GCCTGATCTA CGCCGACGCG GCCAACCTGC CGACCCAGGC GGCGATCAAC GGCGTCTTCG CCAAATACGG CGTGACCATC AACGTGCCGC GCAACGCCAC CTACAGCCAG AACCTCGACG GCACGCTGTT CACGATCTCC TCGCCGATCC TCAACTACCG CTTCCCTGCC GACGGCCCCT ACACCATCAC CAGCAACGGC CAGGTCGCCG TGCCGCTGGG CGAGGCTGCG CCCCTGCAGC AGCCTCTCAA GCGCCAAACC CTGTTTGGCC AGGTCAATTA CCAGCTCAAC GACGCCGTCG AGGCCTACGG CCAGTTCAAC TACGCCCACT ACGTCTCCAA CCAGAGCGGC TACGGCCGCA ACCAGGCGAT CACCCGCGAC GTCTATCTGC CGGTCACCAA CCCCTTCCTG CCCGCCGACC TCAAGGCGAT CGCCGCCTCG CGGCCCAACC CGACCGCGCC GCTGCTGTTC TACTTCAACA CCGGCCGCTA CTCGCCTGAC ATCGCCCAGC AGACCTATAA TGTCGCCCAG GCCCTGGGCG GCCTGAATGG CCGGTTCGAC GCGATCGACG GGACCTGGGA CGTGTTCGCC TCCTATGGCC GCACCGAGCA GAACGCGTTG TTGGGCGGCT ATATCGACCG GGCCTCGTTC TTGTCGCTGG TCAACGCGGC CGATGGCGGC AAGAGCCTCT GCGAGGGCGG TCTGGACCCG CTGGCCCTCA CGGCGCCCTC GAAATCGTGC CTGACCTATC TCCTGCGCGA GATGCACGAG ACCACGACGC TGCAGCAGAC CAACGCCGAG GCCAATGTCC AGGGCCGGGT GCTGGCCCTG CCGGCCGGCG ACCTGCGGTT CGCCGCGGGC CTCAGCTATC GCAAGAACAG CTACGCCTAT TCGCCGGACG CCGCGCGCCT CACCGGGACG GTCCTGACCA CGGGTCTGAG CAAGGCTACG GAAGGGGAGA CCAATTCGAA AGAGGTCTTC CTCGAGCTGG CGGTCCCGCT GCTGAAGGAT CTGCCCTTCA TCCGCAAGCT CGACCTGGAT CTGGCCTATC GCTACTCCGA CTATGACACG GTCGGCGGCG TTCACACCTA CAAGGCCAGC GGCGCCTGGG AGATCAACGA CGCGGTCAGC CTGCGCGGTG GCTATCAGCA CGCGATCCGC GCGCCTTCGG TGGGCGAACT GTTCCGCCCC TCCGAGCAGA GCGCGACCAC CGTCGGGCGG ATTTCCGCCG GCCTGGGCGA TCCCTGCGAC ATCGCCAGCG CCTATCGCAC CGGCCCCAAC GCCGTCCAGG TGCGCGCTCT GTGCATCGCC AACGGCATTC CGCTCAGCGT GATCGACGCC CACAAGTTCG CCGGCACCTC GGTGCAGTCG GACGTTGCGG GCAATCCCGA CCTGAAGGAA GAGTCTTCTG ACACCTATAC GGCGGGCCTG GTCTGGCGCT CGACCTTCCA GCAGCCCCTG CTGCGCCGGC TCTCGGCCTC GGTCGACTAT TACGACATCA AGCTGACCGA CGCGATCGGC CTGATCACCG GCGACGTGAT CGCCCAACGC TGCTTCAACG GTCAGGGTAA CGCCAACCCG ACCTACGATC CGGCCAACTA CTACTGCAAG CTGATCCATC GCGGCTCTTC GGGCGGCTTC TCCAGCATCA CCACGCCGCT GCTCAACCTG GCCGGCTACC GGACCTCGGG CGTCGACCTT CAGGTCGACT GGGGCGCGGA GCTGGCGGCG TTCGGCGCTT CCGAGCAGGC AGGGTCGGTC AGCCTCAATG TGCTGATCTC CCATATGGAC AAGTACGAGA TCCAGACCCT GGCCGGCGCG GCCTACGTCA ACTACGCCGG CACGATCGGC AACGCCCAGA TCAGCGCCGA CGCCATCTCG CACCCCAAGT GGAAGGCCAT CACCAGCCTG GGCTACCATG TGGGGCCCAT CGACCTGAGC CTGCGCTGGC GCTGGGTCGA GGCCATGGGC AACGCCGCCA ATGTCGGCTC GGCCACCGCC ACCGCGCGGG GCGTCAAGGC GATGGACTAT TTCGACCTGA CGGGACGCTA CCAGGTCAAC TCCACCGTCG AGCTAAGGGC CGGGGCGCTC AATCTGGCCG ACCGCCAGCC CCCGGCCTGG ACCGGCGAGA GCGCCACCGA CACCGCCCTC TACGACATGC TGGGCCGCCG CTACTTCCTG GGCGTCAACT TCAAGTTCTG A
|
Protein sequence | MQSRIFALFG AVSTLALSLA HAGVANAQAS RPTAAAATSV EEVVVTGSRI ARQDYTAQSP IVTIGKQILQ AQGPTSIENT LNQLPQFAAT SGSTSASQAG GGRANANLRG LGTARTLVLL DGRRLQPSDP LGAIDLNTLA PSLIDSVEVI TGGASAIYGS DAIAGVVNFK LKHDFEGLEI DAQYGQTERS DGETADLNAT VGGAFADDKG HVMLSASYMD RQRVNRNSRA FFKDGGITSV LPSGLIYADA ANLPTQAAIN GVFAKYGVTI NVPRNATYSQ NLDGTLFTIS SPILNYRFPA DGPYTITSNG QVAVPLGEAA PLQQPLKRQT LFGQVNYQLN DAVEAYGQFN YAHYVSNQSG YGRNQAITRD VYLPVTNPFL PADLKAIAAS RPNPTAPLLF YFNTGRYSPD IAQQTYNVAQ ALGGLNGRFD AIDGTWDVFA SYGRTEQNAL LGGYIDRASF LSLVNAADGG KSLCEGGLDP LALTAPSKSC LTYLLREMHE TTTLQQTNAE ANVQGRVLAL PAGDLRFAAG LSYRKNSYAY SPDAARLTGT VLTTGLSKAT EGETNSKEVF LELAVPLLKD LPFIRKLDLD LAYRYSDYDT VGGVHTYKAS GAWEINDAVS LRGGYQHAIR APSVGELFRP SEQSATTVGR ISAGLGDPCD IASAYRTGPN AVQVRALCIA NGIPLSVIDA HKFAGTSVQS DVAGNPDLKE ESSDTYTAGL VWRSTFQQPL LRRLSASVDY YDIKLTDAIG LITGDVIAQR CFNGQGNANP TYDPANYYCK LIHRGSSGGF SSITTPLLNL AGYRTSGVDL QVDWGAELAA FGASEQAGSV SLNVLISHMD KYEIQTLAGA AYVNYAGTIG NAQISADAIS HPKWKAITSL GYHVGPIDLS LRWRWVEAMG NAANVGSATA TARGVKAMDY FDLTGRYQVN STVELRAGAL NLADRQPPAW TGESATDTAL YDMLGRRYFL GVNFKF
|
| |