Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1672 |
Symbol | |
ID | 5899127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1757297 |
End bp | 1759582 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562162 |
Product | TonB-dependent receptor |
Protein accession | YP_001683299 |
Protein GI | 167645636 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.295189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.636127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAGT TCGGTACCCA GAAGATTGGT CTGCTGACAG GGGCGGCCAT TTCGGCGCTG TCGTTCAGCG CCGCCCTCGC GCAGGAAGCG GCTCCGACCG CCCTGGACGA GATCGTCGTC ACGGCCGAGC GCCGCTCGGA AAACCTGCAG AAGGTGCCGG TGTCGGTGGC CGTGGTCGCC GGCGACCAGC TGCGCGCCAT CCAGGGCGGC GGCGACGACA TCCTCTCGCT GTCGGGCAAG GTGCCCAGCT TCTATGCCGA GACCACCACG GGCCGGATCT TCCCGCGCTT CTACATCCGC GGCCTGGGCA ATATCGACTT CTATCTCGGC GCTTCGCAGC CGGTGTCGAT CATCCAGGAC GACGTCGTGC TCGAGCACGT GGTCCTGAAG TCCAACCCGC TGTTCGACGT CAAGCAGGTT GAAGTGCTGC GCGGCCCGCA GGGCTCGCTG TTTGGCCGCA ACACCACGGC CGGCATCGTC AAGTTCGACA CCAACCGTCC GACCGAGACC CTCGAAGGCC GCGCCAGCGC CTCCTACGGC ACCTACGGCA CGACCACCTT CGACGGCGGC ATCGGCGGCC CGATCGCCGG CGACAAGCTG ATGTTCCGCC TGTCGGCGCT GTACCAGCAC CGCGACGACT ATGTCGACAA CACCTTCGCC GGCACGAGCG CCGACGGCAC CGTCACGCCG AAGAAGAACG CCATGGGCGG CTTCGACGAA AAGGACGTGC GCCTGCAGAT CCTGGCCAAG CCGACCGATC AACTGACGGT CCTGGCCTCG GCCCACGCCC GCAACTACGA GGGCACCTCG ACCCTGTTCC TGCGCCAGGC GCTGAAGAAG GGTTCGAACA ACTCGACCGC CCCGCGCGAC AGCGTCGCCC TCGACGAGGG CAACAACAAC CCGCAGGCCT ATGACACCCA GGGCACGTCG CTGAACGTCG CCTACGACTT CGGTCCGGTG ACCCTCACCT CGATCAGCGC CTACGAGACC ACCAGCGGCT ACAGCCGCGG CGACACCGAC GGCGGCGCGG CGGCCAACTA TCCGGTCGGC GGCGTTCCCA ACGGCTTTGG CCAGTCCATG GGCCGGGTTC GCGACCTCGA CCAGCTGACC CAGGAAATCC GCCTGGCCAG CAACGGCGAC GACCGCCTGA AGTGGCAGGT CGGCGCGCTG TACTTCGACT CGCGCGACAC GACCGACTTC TATCAGCGCG CCTACTTCAC CAAGACCAAC CCCAACAACT GGGTCGAGCT GAACAACCTC AACACCTCGT GGGCGGTGTT CGGCCAGGTC AGCTACAAGG TCACCGACGC CCTGACGATC ACGGGCGGCC TGCGCGACAC CTACGACGCC AAGAAGACCA TCCTGGCCAA GACCGCCAAC ACCGCCGCCA ACGCCGTGAC CTATGCCGGT CGCCGCTATG TGCGTCTGTC GGACGAACAG GTCAGCTGGG ACCTGAGCGC CAACTACGAG GTCAATCCCG ACCTGAACCT CTACGCGCGG GCCGCCAAGG GCTTCCGTGG TCCGACCATC CAGGGCCGCT CGGCGGTGTT CAACAGCGAC TTCACGACCG CCAATTCCGA AACGATCCTG TCGTGGGAAG CGGGCTTCAA GAGCACGCTG CTGGACAACA CCCTGCGCCT GAACGCCTCG GCCTTCACCT ATGAGGTCAA GGACATCCAG CTGAACGGCA ACGATTCCAA CGGCAACGGC GTGCTGTTCA ACGCCGACAA GGCCAAGGCC TACGGGCTGG AAGCCGACGC CGAATGGCGT CCGGTCTCGA ACCTGACCCT GACGGCCGGC GTCAGCCTGC TGCACAGCGA GATCCAGGAC AAGCGCGTCT ACGCCCAGGT CTGCGCGCTC AACAGCGTGG TGGTCTGTAC GGTCAAGAAT CCGACGATCG CGATCGTCGG TCCGTTCGGC ACCAGCACCT TCGCCCAGAT CGACGGCCAG CCGCTGCCCA ACGCGCCGAA GTACAACTTC AACTTCACGG CGCGCTACGA CGTGCCGGTC GGGGCCGACG GCAAGCTGTT CATCGCCACC GACTGGAACG TGCAGGGCTA TACGAACTTC GTGCTCTACG ACACCGACGA GTTCTACTCG AAGGGCAACT TCGAAGGCGG CCTGAAGCTC GGCTACGAAG GTGGTAACGG AGCCTATGAA GTGGCCCTGT TTGGCCGCAA CATCACCAAC GAGAAGAACC TCAAGGGCGT GATCGAGAAC TACATGGCGG CCGTCTATAA CGAGCCGCGC ATCGTGGGCA TCTCGGTCAG CGCGAAGCTG AAATAG
|
Protein sequence | MRKFGTQKIG LLTGAAISAL SFSAALAQEA APTALDEIVV TAERRSENLQ KVPVSVAVVA GDQLRAIQGG GDDILSLSGK VPSFYAETTT GRIFPRFYIR GLGNIDFYLG ASQPVSIIQD DVVLEHVVLK SNPLFDVKQV EVLRGPQGSL FGRNTTAGIV KFDTNRPTET LEGRASASYG TYGTTTFDGG IGGPIAGDKL MFRLSALYQH RDDYVDNTFA GTSADGTVTP KKNAMGGFDE KDVRLQILAK PTDQLTVLAS AHARNYEGTS TLFLRQALKK GSNNSTAPRD SVALDEGNNN PQAYDTQGTS LNVAYDFGPV TLTSISAYET TSGYSRGDTD GGAAANYPVG GVPNGFGQSM GRVRDLDQLT QEIRLASNGD DRLKWQVGAL YFDSRDTTDF YQRAYFTKTN PNNWVELNNL NTSWAVFGQV SYKVTDALTI TGGLRDTYDA KKTILAKTAN TAANAVTYAG RRYVRLSDEQ VSWDLSANYE VNPDLNLYAR AAKGFRGPTI QGRSAVFNSD FTTANSETIL SWEAGFKSTL LDNTLRLNAS AFTYEVKDIQ LNGNDSNGNG VLFNADKAKA YGLEADAEWR PVSNLTLTAG VSLLHSEIQD KRVYAQVCAL NSVVVCTVKN PTIAIVGPFG TSTFAQIDGQ PLPNAPKYNF NFTARYDVPV GADGKLFIAT DWNVQGYTNF VLYDTDEFYS KGNFEGGLKL GYEGGNGAYE VALFGRNITN EKNLKGVIEN YMAAVYNEPR IVGISVSAKL K
|
| |