Gene Caul_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1120 
Symbol 
ID5898575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1186824 
End bp1189652 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content66% 
IMG OID641561602 
ProductTonB-dependent receptor 
Protein accessionYP_001682748 
Protein GI167645085 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00264761 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATGC ACCTGCTCCG GGCCAGCGCC CTGGCCGGCG CGGCGGGCCT GATGATCGCC 
GGACAGGCGT TGGCCCAGAC GACGCCAACG CCGGCTCCGG CCAGCCCAAC CGATGAGACC
GCGGCCGTCG AGGCCCTGGT CGTCACCGGC TCGCGCATCC CCCGCATCGC CACCGAGGGT
CCCGCCCCGG TGACCGTGAT CACCAGCGAC ACGATCAAGG CCGCCGGCTT CACCAGCGTG
CCCGACGTGC TGCGCAGCCT GACCCAGAAC GGCGGCGAGA CCCAGACCCA GCAATCGTCC
AGCGGCGCCG ACTGGACGCC CGGCGCCCAG CAGGTCGATC TGCGCGGTCT TGGCCCCAAC
CACACCCTGG TGTTGGTCAA CGGCCGCCGT ATCGCCGACT TCCCCCTGCC CTTCAACGGC
AAGAGCGCCT TCACCGACAC CTCCTCCATT CCCCTGGGCA TGATCGACCG GATCGAGGTG
CTCAGCGGCA GCGCCTCGGC CGTCTACGGC TCGGACGCCA TCTCGGGCGT GGTCAACTTC
AATCTCAAGA AGAAGGTCGA CGGCACGACG GTCGACATGA CCTTCGGCGG ACTGGAGCAT
GGCGGCGCGG CCAGCCAGCG GCTCAATGTC TCGACCGGCT ATTCCAAGGG CGACTTCGAC
CTGGTGATCG GCGGCGAGTT CGTCAATCAA AAGCCCCTCT GGGCCTATGA CCGCGACATC
CAGGACTCGA CCAAGGACGC CCCCACCGCC GGCGCCCGGA TCGCCCGCCG CGACTTCCTG
CGCATGGATC CCGCGGAGGA CGTCTATGTC GATCCCGGCA AGACCACCTG CGACAGCCTC
AAGACTTTGA ATGGCGGCTC GGTCGAATAC GCCAGCCGTC CGCGCTGGGG CGCCTATGAC
CCCGACACCG ACGACTACGG CCCGGGCTAC TACTGCGGCA GCTACAGCTC GATCGGCTAC
GGCACGATCA TCAGCGAGCG CAAGAGCGCC AATGTCGTGG CCTCGCTGAA CTACGCCAAG
ACCGACACCC TGGCCTTCTT CGCCGACATC TCGGCCGGCT ACAGCCGCAC CCGCCAGTTC
CAGGACGTGC TGTCGTGGAA CTACCAGGAC GCCAACGGCA GCGAAGACGG GATCTTCTAC
AACCAGTTCA CCGGCGCCCT GGATTTCTGG CAGCGCAACT TCACGCCCGA AGAGATGGGC
GGCCTGCACA AGGGCTACAT CACCAACACC TCGCGCACCT TCAGCATCAC CCCGGGCGTC
AAGGGCTCGC TGGGCGACGG CTGGGACTAT GAGGCCTTCT ACAATTTCAG CCAGTACAAG
TCGTCGATCA GCTGGCCCAA GGTGGTCAAT TCCAAGGCCA CGGCCCTGTT CCTGGGTCCG
CAACTGGGCG TGGACGCCGA CAGCGGCTAC GCGATCTTCA ACGCCGATCC GGCCCGGCTC
TACAAGCCTC TGACCACGGC CGAATACGAC TCGATCACCG CCCGCACCAC CTACAAGCCG
GTGGCCCGCC AGCAGGGCGT GTCGTTCCAG GTCAACAAGG CCGATCTGTT CACCCTGCCC
GCCGGTCCGG TCGGCTTCGC CGCCGTCGCC GAATACGGCA AGCAGTCCTA CAAGCTGGGG
CTCGATCCCC TGGCCACTCA GAACTACTAT TACCACCTGC GCGACGCCGA CGGTTCAGGC
TCGCGCGATC ACTGGGGCGC CGGCTACGAG TTCCGCGCCC CGCTGCTGAA GAGCCTCGAG
CTTTCGACCG CCGGCCGCTA CGACAGCTAC AAGTACGGCG GCAACACCAT CGACAAGTTC
ACCTATAACG GCGGACTGGA GTGGCGGCCG GTCAAGTCGC TGCTGGTGCG CGGCGCCTAT
GGCACCGGCT TCCGCGCCCC TGACCTTCAC TACGTGTTCG CCAAGGAGGG CATCAGCCAC
CCGTCGGGCA CCGACTACTA CCGCTGCCGC ACCGAGGAGC CGGACGAGGA TATCGGCGAC
TGCTCCTATG CCGACGAGGG CCTGGTCAAG GTTCGCCTGG GCAACGCCAA GCTGAAGCCC
GAGACCTCGA AGTCGCTGAA CTACGGCGTC GTCTGGTCGC CGGTGCGCAA CTTCGACATC
TCGGTCGACT ACTTCCGGGT TCAGCTGAAC AACGCCGTTC TGGACATGAG CCTGGACAGC
ATTCTGCGCC AGGAAGGCGA CTGCCGCGTC GGCACGACCG CCGCCGGCAC TCCGGTCAGC
ATCACCTCGC CGACCTGCGT CGACGCCCTG GCCCGCGTGG TCCGCAACCC GCTGACCGCG
GCGATCGATC CCGGCGGCAT CTCGACCGTC ACGATCAACC CGATCAACGT CGCCACCGAG
AAGACCAACG GCATTGATGT GGCCGCCCAC TATCGGCTGG CCACCGACGG ACTGGGAACC
TTCGACTTCA GCCTGGCCCA TACCTGGGTC GACAAGCACA CCAGCCGGCA ATATCCGGGC
GACCCGATCG AGAACCAGCT GGCCTATGAC AGCGGCTACG ACGTCCCGCG CACCAAGAGC
AGCGCGGCCA TCAACTGGAA CAAGGACGCC CTGTCGATCG GCCTGCACGG CCAGCGGCTG
GAGCGCCTGC CCAACTACGC CGAGGACGGC TGGATCAAGG CCACCTACCT GGTCAACGCC
ACGATCCAAT ACGAGATCGA TCCGCGCACG CGGGTCAGCC TGGCGATCGA CAACCTGCTG
GACAAGGCCC CGCCCCGCGA CCCGACCTAT TCGGGCTATC CGTACTACGA CACCTCGTGG
TTCGACTCGA CGGGCCGCAG CTACTACCTG CAACTGACGC ACAAGTTCGG CGGCAATAGC
GGGCTGTAG
 
Protein sequence
MKMHLLRASA LAGAAGLMIA GQALAQTTPT PAPASPTDET AAVEALVVTG SRIPRIATEG 
PAPVTVITSD TIKAAGFTSV PDVLRSLTQN GGETQTQQSS SGADWTPGAQ QVDLRGLGPN
HTLVLVNGRR IADFPLPFNG KSAFTDTSSI PLGMIDRIEV LSGSASAVYG SDAISGVVNF
NLKKKVDGTT VDMTFGGLEH GGAASQRLNV STGYSKGDFD LVIGGEFVNQ KPLWAYDRDI
QDSTKDAPTA GARIARRDFL RMDPAEDVYV DPGKTTCDSL KTLNGGSVEY ASRPRWGAYD
PDTDDYGPGY YCGSYSSIGY GTIISERKSA NVVASLNYAK TDTLAFFADI SAGYSRTRQF
QDVLSWNYQD ANGSEDGIFY NQFTGALDFW QRNFTPEEMG GLHKGYITNT SRTFSITPGV
KGSLGDGWDY EAFYNFSQYK SSISWPKVVN SKATALFLGP QLGVDADSGY AIFNADPARL
YKPLTTAEYD SITARTTYKP VARQQGVSFQ VNKADLFTLP AGPVGFAAVA EYGKQSYKLG
LDPLATQNYY YHLRDADGSG SRDHWGAGYE FRAPLLKSLE LSTAGRYDSY KYGGNTIDKF
TYNGGLEWRP VKSLLVRGAY GTGFRAPDLH YVFAKEGISH PSGTDYYRCR TEEPDEDIGD
CSYADEGLVK VRLGNAKLKP ETSKSLNYGV VWSPVRNFDI SVDYFRVQLN NAVLDMSLDS
ILRQEGDCRV GTTAAGTPVS ITSPTCVDAL ARVVRNPLTA AIDPGGISTV TINPINVATE
KTNGIDVAAH YRLATDGLGT FDFSLAHTWV DKHTSRQYPG DPIENQLAYD SGYDVPRTKS
SAAINWNKDA LSIGLHGQRL ERLPNYAEDG WIKATYLVNA TIQYEIDPRT RVSLAIDNLL
DKAPPRDPTY SGYPYYDTSW FDSTGRSYYL QLTHKFGGNS GL