Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2126 |
Symbol | |
ID | 5899581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2292028 |
End bp | 2294988 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562615 |
Product | TonB-dependent receptor |
Protein accession | YP_001683752 |
Protein GI | 167646089 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.593846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.258054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTCC GCCGCGTGTC CGCCCTGACG ATCGCGGTGC TGGCAGTGAC CGCGCCGGCG TTGGCGCAAG GTCCGGCGCC GCGTTTTGAC ATTCCGGCGC AGGACGCCCG CGCCGGCCTG ATGGCCCTGT GCTTGAAGGC CGGCTGCGCT TTCGCCTTTT CGACCGAGCC GGGCCGCACC TACCGCGCCA ACGCCGTCGC CGGGACCATG TCGTGGCAAG AGGCTCTGAA GCGCCTGCTG GCGGGCACGG GCCTACGCTA CGAGATGGCA GACCATGCCT CCGTGCGGGT GTGGGCCGAC GCCACCCCCG CATCACCGCG TGTCGCGCCG ACGCCTGAGG CGCCCGTCGA CCTGGACGCG GTCACGATCA CCGCCGCCTT CGTGGCCGGC ATTGAGGATT CCCTGCTCCA GAAGCGTCGC GCCGACGCCA TCGTTGACGC CATCTCGGCC GGCCGCATCG GCGAGCTTCC GACCGCCAAC CTCGCCGAGG CCCTGCAACG CGTGCCCGGC GTTGCGATCG AGCGTGAGGT GGGCGAGGGG CAGTTCGTCA GCGTCCGGGG CCTGGGGCCG CTGTTCCAGT CGGTGACCCT GAACGGCGCG CCGGTGGCGT TCAACGAGAA CATCCGCAAC TCTACCCAGA GCGGTCGCCA ATTCCGCTTC CGCGCCCTCT CGGCCGACCT GCTGGCCGGG GCCGTGGTGG CCAAGTCCGC GACCGCCGAC ATCGTCGATG GCGGCATCGG TGCGAACATC GACATCCGCA CGGTGCGCGG GTTGGAGGGA GCGTCTTACT TGTCGTTTCG CGCGGACGCC CATGCCGAGG CGCGCTCGGG AGCCGTCTCG CCGGACCTGG CCGTCTCGGG CCGCTGGCGG CGGGTGGACG GGCGGCTGGG CGTGGTCGGC GGTCTCTCCA CCGAGCGCCG CGAAGTGCAG TACGATCGCC TCCAGATCCA GCGCTATCGC AACGTCGTCA TGAACGGCCA GGTGTTGGCG GTTCCCGACG ACGTGCGCAC CACGGTCGAA CAGGAGCAGC GGGCCCGCGC CACGGCCTTC GTCGGCGTCG AGTGGCGGGT AGCGCCCACG GCCAGCCTCT ATTTCGACGT CCTGGCCTCG CGCTTCGACA ACGCCATTCG GGAGGACCGC ATCGTCTATA CGATTGGAGA CTACGCGACC TCGGCCCTGG CCGAACCCCG GGTCGTGGAG GGCTCCCTGG TAGGTGGACG GATCACGGCT GGCCAGATCA GCAACAATCT CGAGGTTTCC GACCAGGTCC ACGACAACGT CTCCCTCAGC CTGGCTATGA AGGCCTTGGT CGGAGACTGG CGTCTTGAGC CGCGCCTCAG CGTCTCGAAC GCGGACTCAA ACCTCGACAC GCCGTTGCAA CGGATCGGGG CTGTGAGTCC GTTGGGCGTG TCCTATGACT TCGACCTGGG GCCGGATCTC GTGCGCGGCC GCGAGGCGCC GCGGCTGGCG ACCAGTTTCG ACCTGACCGA CCCGCATCAG TTGACCTTTT CGCGCTACGG CGTGCGCGCC ACCCAGGTCG AGGACCACGA CTCCACTGGC CTGATCGCCG CCGAGCGGCC AGTCGAATGG AGGCTTGGGC CGCTGCGCAT CGAGCGCCTG CGGCTGGGCG GCCAGGTCAG CGACCGCAGG CGCGACTATC AGCGCCGCGA CCGGGACGCC ACGCTTCGAC CCGGGGCCGC CGTTGATCCG GGCTTCTTCG GCGTGCTCGC GCCCGAGGAC GGCTTCGACC GCCTGGTGGC CGACCGGCCG CCGGCCTGGA CGGCCGCCGA TTTCTCGGCC TTCCGCGCGG CCTTCGTGTT GGCGGGGGAG GCGGACAGCG TGATCGTCGA CGCAGCCGAC CTCAAGCCCG CGGGCGCCGA TCTGCAGGGA TCCTACAAGG TCGGCGAACG GATCCTGGCC GGTTATGGAC GCCTGGATTT TTCAACCACG GTGCTGGGGC GACCGGCCAG TGGCAATGTC GGCGTGCGCA CCGCGCGGAC CGCGACGGAC GTTGCTGGGT CGCGACTGGG CGTTTCGGCG ACTGGGCAGC TTGAAGTCAC GCCGGTCGAC TATGACGGCT CCCAAGCGGT GACCCTGCCC AGCGCCAATC TGGCTATCGA CCTGAACGAG CGCTGGCGGC TACGTCTGGC CGCCTCGCGC AGCATCACTC GGCCATCCCT GGCGGACCTG CGTTCGGCCA CCGTGCCGGC CAGCAGCCTT GTCTCGATCC TCTATGAGCG CGGCCAGATG GAGATCGACC ATCCGTCGGA GGGGACCCTG TTTTCCGGCG TCGGCGGCAA TCCGGCGCTC AAGCCGTATC TGGCGACCAA CTACGACCTG TCGCTCGAGC GCGAGTTCGA GAATTTTGGC GGCGTCAGCC TGGCGGCTTT CCACAAGACC ATCGACGACT TCATCGTCGT CTCGGCCCGG CCCGAGCGCC TGGCGTTTGA CACGCGCAGC GGTCCGCCGG TGACGGCGCT GGTCATGATG TCCCGCCCCC ATAACGCCGG CGAGGCGCGC GTCACTGGCG TCGAGGCCGC CTTCAGCCGA CGATTTCCCG CTGGCTTGGG CGTCTGGGCC AGTGCGACCC TGGTTGACGC CTGGAGCCGC GACGCGCTTG GCCAGCGCGG CCGTTTGAAT GGAGTCTCGC GCCTCTCCTA TTCGATCAGC CCCTTCCTGG AACACGGCCC TTTGCACGCG CATCTGTCCT GGACCTGGCG CTCGCCCTTC GGCTCCGAGG CCGACATGCA AGGCGGCGGG GTGTCCAGCT TCGTTGTCGC CAGCACTGGC TACCTCGACG CCGCCGGATC CTATGACCTT ACGTCCCATG TCTCCCTCTT TGTGCAGGCC AGCAATCTGA CCGACACCAT CGAGGCGGCC TACGAGGGCC AGCGCAGCCG CCCGCTCCAG ATTGGCCGCT CCGGCCGGTC GTTTGGGCTC GGCGTGAGGA TAAGGGGCTA G
|
Protein sequence | MSFRRVSALT IAVLAVTAPA LAQGPAPRFD IPAQDARAGL MALCLKAGCA FAFSTEPGRT YRANAVAGTM SWQEALKRLL AGTGLRYEMA DHASVRVWAD ATPASPRVAP TPEAPVDLDA VTITAAFVAG IEDSLLQKRR ADAIVDAISA GRIGELPTAN LAEALQRVPG VAIEREVGEG QFVSVRGLGP LFQSVTLNGA PVAFNENIRN STQSGRQFRF RALSADLLAG AVVAKSATAD IVDGGIGANI DIRTVRGLEG ASYLSFRADA HAEARSGAVS PDLAVSGRWR RVDGRLGVVG GLSTERREVQ YDRLQIQRYR NVVMNGQVLA VPDDVRTTVE QEQRARATAF VGVEWRVAPT ASLYFDVLAS RFDNAIREDR IVYTIGDYAT SALAEPRVVE GSLVGGRITA GQISNNLEVS DQVHDNVSLS LAMKALVGDW RLEPRLSVSN ADSNLDTPLQ RIGAVSPLGV SYDFDLGPDL VRGREAPRLA TSFDLTDPHQ LTFSRYGVRA TQVEDHDSTG LIAAERPVEW RLGPLRIERL RLGGQVSDRR RDYQRRDRDA TLRPGAAVDP GFFGVLAPED GFDRLVADRP PAWTAADFSA FRAAFVLAGE ADSVIVDAAD LKPAGADLQG SYKVGERILA GYGRLDFSTT VLGRPASGNV GVRTARTATD VAGSRLGVSA TGQLEVTPVD YDGSQAVTLP SANLAIDLNE RWRLRLAASR SITRPSLADL RSATVPASSL VSILYERGQM EIDHPSEGTL FSGVGGNPAL KPYLATNYDL SLEREFENFG GVSLAAFHKT IDDFIVVSAR PERLAFDTRS GPPVTALVMM SRPHNAGEAR VTGVEAAFSR RFPAGLGVWA SATLVDAWSR DALGQRGRLN GVSRLSYSIS PFLEHGPLHA HLSWTWRSPF GSEADMQGGG VSSFVVASTG YLDAAGSYDL TSHVSLFVQA SNLTDTIEAA YEGQRSRPLQ IGRSGRSFGL GVRIRG
|
| |