Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1967 |
Symbol | |
ID | 5899422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2108530 |
End bp | 2111319 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562457 |
Product | TonB-dependent receptor plug |
Protein accession | YP_001683594 |
Protein GI | 167645931 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.559589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTCT GGCGCGTCAC GGCTCACGGG ATCTGGGCGA CGGCGCTCCT GGCGGCGGCG CCGAGCCTAG CCCGCGACGT CGAACCGCGC CGCGCGTTCG ATCTGGAGGC CGCGCCGCTG GGCAGGGCCT TGTTGGCGTT CAACCTGGCG ACCGGCGCCC AGGTCATCGT CTCGCGCGAC CTGCTGGTCG GCCGCGTCTC GCGCCCGGTG CGCGGTCAAT ACACCGCCAC CCAGGCCCTG GCGCGCATGC TGACCGGATC GGGTCTGCAT GCGGAGCGCA CGGGGCGCGG CGTCCTGATG ATCCTACCCG ACCGGCCGCC GGCGGCTCCA AGGCCCAGCC CCAGCCCCAG CGCGCCCGAA GCCGTCGCCA TCGCCGAGAT GGAGGGCGTG ACGGTCACCG CGCCCCGCAT CCTGCCGCCC GTCGCTCTGA CCACGCTGAG CGGCGCGCGC CTGGAGCAAC TGGGCGTGGT TGGCATGGAT CAGGTCGGGC GGCTGACGCC GGGCCTCAAT GTCGTCAATC TCTCCCCGGC CGCTGCGGCC TTCTCGATGC GCGGCGTCAC CCAGGCCTCG GGCGACGCCA GCCGCGAGCC GAGAATGGCG GTGCTGCAGG ACGGCATGCC AGCGTCCAAG GAGCGCGGCG CCTATTTCGA GCTCTTCGAT CTTGACCGCA TCGACATCGC CAAGGGCCCT CAGCCCACGC TCTACGGCCG CGGGGCGATG GCGGGGGCGC TCGACGTCAT CCAGCACAAG GCCGATCCGC GCGCCAAGGC CTGGAGCCTT CATGGCGAGG TCGGCGACCA TGGCCAGGGC CTGCTGGACG CCATGGTCAA CCAACCGATC GGCCAGGACC TAGCGGTGCG TGTCGCCGCT CGCTGGCGAG GACGAGACGG CCTGACGCCC AACCTGGCTG GCGGCGACCG CCTGGACTCA GTCGCCACCG ACGCCGCCCG CGTCTCACTG GCCTGGCGTC CCGAGGATGG CGAACGGCTT GACCTGATCG TCAACTACCA GCGGGACCGT CCCACTGGCC GGCCGCTCAA ATCCTTCACC TTCGCGCCCA CCGACCCGGT CACGGGCGCG GCTCTGGGCG ATCTTGACCG CCACACCCCG CCCGCCTTGA GCGCCGTCGA CGAGAACGGC CGCCCCGCGG CGCTCGGCCT GGACCGTACG CTGGGCTCGG TCGCGGTCCT GGCCGCGCGC CGGCCGAGCC CGGGTCTGAA CCTGAGCGCG TCTCTGGGCT ATCGCCGCTT CTACGCCGAC GAGCGCCAGG ACGGCGATGG CCTGTCCCTG CCGGTGATCA GCGTCTTCGA GCAGACCCGG GGCGTGCAGT ACAACGCCGA CCTGCGCCTG GCCTACGACG CCGGCGAGCG TTGGCGGGGT TTCGCAGGCG TCAGCCTGTT CCACGAAAGC GGCACCCAGC GGGCCTATTT CACGATCGAC GAACGCCTGC TCCTGGCGCG CATGGCCGGC GGCCTGGACA CGCCCGTCCC CCAAGGTCTC GATACGCTGA CCCAGCCGGA CTTCGTGGCC GCGCAGCTGC GCCAACTGGC CGCGCGGCGC GGCTTCGACC TCCCGGGCGA TCTGGCCCTG GGCGCGGCCG GCAACCTCAA GTCGGCCTAT GTCGAGCGCA ACCAGAACTT CGCGCGGACC ACCGCCGTCG ATCTCTTCGG CGACCTCACC TTCGCGCCAA GCCCGACCTG GGCGCTTTCG GCCGGCCTGC GCTACAGCCA CGACGCCAAG GCCAGCGCGG TCGCCACCGC CCTGCCCGCC GGCCCGTCGG TGCTGGCGGG AGCCGTCCAG GCCCTTGGCC TTCCCTGGGG GCAGAGGGCC GCCTTGCTCG CCGCGCTCAG CGCGCCAGGC GCGGGCGCCA GCGCCGAGAC TCCCCGCGCC TGGCCCAACT ACGCCTTGGT GTTCCAACCG ACATCGGGAA ATGGAGACAA GGTGTCGAAT GCGCTGTCCG ACGAGGGCTG GAGCGGGCGA CTGGTGGCGC GCTATACGCC ATCGTCGGCG TTCAGCGCTT ACGGCTCCTA CGCCCGGGGG CGACGGCCCA GCGTCCTGGT GGCCGGACCG CCTCTGGTGG CCGAGGGTCC GGCGCGGTTC GGAATCGTGC CCGCCGAGAC GATCGACTCC CTGGAGATCG GAACCTGCGG CGTCGGTTTC GACCAACGCG TGGATCTGAA CCTTGCCGCT TACGCCTATC GCTATGTCAA CTTCCAGACC ACCAAGCTTG TCGATGGTCA ACTGCAGACA CTGAACGCCG GTCGAGCCGA CGCGACCGGT CTGGAGGCCG AGGCGCGGTT CGAGCTGTCG CGCTCAGTCC GGCTGTTCGC GGCCTACGGC TACAACCGTT CAAGACTGCG CAGCGGCGCC TTCGCTGGCA ATCAATTTCG CCTCGCCCCC GATCACAAGG TCTCGCTCAA CCTGGAACTG ACATACGCCA TGCCGGGCGG CGCGCTCACG GTCCTGCCGA CTTGGAGCTG GCGATCGAAG GTCTTCTTCT CCGACGACAA CGATCGCCCC CAACTGTCCC GCGGACTAGT GCGCGATCTG GTGCAAGACG AGGTGCAAGG CAGCTACGAC TTGCTGGATC TGCGGGTGAA CTATCAGCCG CGCGCCGCCA ACTGGAGCGC GGGCGTCTTT GTCACCAATC TCCTGGATCG GCGCTATCTT CAGGAAGCAG GCTTCATCGG CGAAAGCTTC GGATTCACAG CGTCGGCGCA GGGGACGTCG CGTCTCTGGG GGCTATCTTT ACGTATCGTG AGTGGAGCGA GGCAGGACTT CGCGAAATAG
|
Protein sequence | MRFWRVTAHG IWATALLAAA PSLARDVEPR RAFDLEAAPL GRALLAFNLA TGAQVIVSRD LLVGRVSRPV RGQYTATQAL ARMLTGSGLH AERTGRGVLM ILPDRPPAAP RPSPSPSAPE AVAIAEMEGV TVTAPRILPP VALTTLSGAR LEQLGVVGMD QVGRLTPGLN VVNLSPAAAA FSMRGVTQAS GDASREPRMA VLQDGMPASK ERGAYFELFD LDRIDIAKGP QPTLYGRGAM AGALDVIQHK ADPRAKAWSL HGEVGDHGQG LLDAMVNQPI GQDLAVRVAA RWRGRDGLTP NLAGGDRLDS VATDAARVSL AWRPEDGERL DLIVNYQRDR PTGRPLKSFT FAPTDPVTGA ALGDLDRHTP PALSAVDENG RPAALGLDRT LGSVAVLAAR RPSPGLNLSA SLGYRRFYAD ERQDGDGLSL PVISVFEQTR GVQYNADLRL AYDAGERWRG FAGVSLFHES GTQRAYFTID ERLLLARMAG GLDTPVPQGL DTLTQPDFVA AQLRQLAARR GFDLPGDLAL GAAGNLKSAY VERNQNFART TAVDLFGDLT FAPSPTWALS AGLRYSHDAK ASAVATALPA GPSVLAGAVQ ALGLPWGQRA ALLAALSAPG AGASAETPRA WPNYALVFQP TSGNGDKVSN ALSDEGWSGR LVARYTPSSA FSAYGSYARG RRPSVLVAGP PLVAEGPARF GIVPAETIDS LEIGTCGVGF DQRVDLNLAA YAYRYVNFQT TKLVDGQLQT LNAGRADATG LEAEARFELS RSVRLFAAYG YNRSRLRSGA FAGNQFRLAP DHKVSLNLEL TYAMPGGALT VLPTWSWRSK VFFSDDNDRP QLSRGLVRDL VQDEVQGSYD LLDLRVNYQP RAANWSAGVF VTNLLDRRYL QEAGFIGESF GFTASAQGTS RLWGLSLRIV SGARQDFAK
|
| |