Gene Caul_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0292 
Symbol 
ID5897566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp321946 
End bp324729 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content64% 
IMG OID641560776 
ProductTonB-dependent receptor 
Protein accessionYP_001681927 
Protein GI167644264 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.73906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATT TCCACAAGAC CATTCTAACA GCGTCCGCCA GCGTGCTGGC GGTGGCGATG 
GCCGCGCCGG CCTTCGCGCA GGACAACACC ACGGTCGACG CCGTCGTCGT CACCGGCATT
CGCGCCTCGC TGCAGGCCTC TATCGACTCC AAGCGCAACG CCAACGCCAT CGTCGACGTG
ATCACCGCCG AAGACATCGG CAAGTTCCCG GACAAGAACG TCGCCGAGTC TCTGCAGCGC
GTCCCCGGCG TCACCATCCA GCGCGAATTT GGCGAAGGCG AGCGGATTTC GGTTCGCGGC
ACCGCGCCGA CCCTGAACCG CACCCTGTTG AACGGCCACG CCGTGGCCAC CGCCGACTGG
TTCATCCTCG ACCAGTTCAA GGCCAGCCGC AGCTTCAACT ACCTGATGCT GCCGTCGGAA
ATCGTCGGCA AGGTCGAGGT CTTCAAGAGC CCGGTCGCTG ACATCGACGA AGGTGGCGTC
GGGGCCACCG TGGATGTCCA CACCCGCAAG CCGCTCGACC TGGCGCCGAT CGCGATCTCG
GGTTCGATCC AGGGCGTGTG GGACGAGAAG TCCGACAAGA CGACCCCGAC CGCCTCGGCC
ATGCTCAGCT GGCACAACGA CGCCAAGACC TTCGGCGTCC TGGGCGGGCT GATCTATGAC
AAGCGCGAGA TCCGCCGCGA CGGCATCGAA GTGCTGGGCT ACGAGAACGT CACCAACGCC
TCGGGCAACG TCACGGGCCT GCCCGCCGGC CAGACGGCGG TGACGCCCAG CCTGATCGGT
TCGTCGTACT TCAAGCAGAC CCGCGAGCGC AAAGGCGCGA ACTTCTCGAT CCAGTGGGCC
CCGACCGACA AGTTCGAAGT CGAGCTGACC GGCCTCTATT CCAAGATGGA CGCCGACAAC
ATCAACCAGA ACTACATGGC CTGGATTACC AACAAGATCT CGGGCCTGGG CACGCCCGGC
GCCAATTTCA ACATCCAGAG CCTGAACAAC GGCACCGCGA CCAAGGGTAC GTTCAACGCC
ATCGGCGGCC AGGGCGTGGT GTTCGACGCC ATCGACCGTA TCGCCCACAC CCAGGTCCGC
TCGATCGACC TGAACTCCAC CTGGCGTCCG GCCGAGGGAT GGGAATTCCA CGGACGGGTC
GGCTATACCG ACGCCGAGGG CTCCACCGAC CTGCAGCCGT TCTGGGAAAC CAGCGCCGCC
ACGGGCCTGA CCTACGACCT GTCGGGCGGC CTGCCCAAAG TATCGTTCAG CACCATCAAT
CCGGCCACGG CCGATGACGA GATGGAACTG GGCTGGGCCT CGGCCAACAC CACCGCCAAC
GACGACAACG AGTTCTACAC CTTCGTCGAC GGCGAGAAGT TCATGGACGC CGGCGCCTTC
ACCTCGGTCA AGTTCGGCCT GAAATATACC GACCACGATC GCGATGTGGA CCAGACCTAT
GGCCAGCGCC GCGCCCTGCT GCCGTGGACC GCTCCGGGCG CCACCGCCTG CGGCGGTCAC
CCCTGCTCGC TCGCCGATGT TCAAGGCGGT CTGACGCCCA GCGACTATCT GGACGGCATC
AAGGGTCCCG GCGTCATCAG CAGCTATCTG ACCGCCGACA AGAACAAGAT CGAGGCGATC
TACAACAAGC TGCCCAAGGC GGCGGTGTGG AACTCCGCGA TCGGCGCGAC CCAGGCCGCC
GGCTGTGACG GCCTGCTGAA CTGCGACCAT TTCGGCCCGC TGGAAAGCTT CACGTTCAAC
GAGAAGACCT TTGGCGGCTA TGTCCTGGGC AAGCTCAAGG GCGACAATTG GCGCGGCAAT
GTTGGCCTGC GCATCGTGCA GACCAAGGTC GACACCTCGG CGTGGAAGGT CGGCGTTCCG
GCCGGCACGG CTGGCGCGGT CAACAACCCC TTCGGCCCGA TCGCTCCGGT CGACGCCGGC
CAAAAATATA CGGATATCCT GCCGAGCGCG AACTTCTCGT TCGACGTGCG GGATGACCTG
GTCCTGCGTC TGGCCGCCGG TCGCACCCTG TCGCGTCCGG ACTACGCCCA GATGGCGGCG
TTCACCTCGC TGACCGAGTC CCTGCTGACC GGCAGCGGCG GCAACCCGAA CCTGGATCCC
TACCGCGCCA ACCAGTACGA CGCGTCGCTG GAATGGTACT TCGCGCCGCA GTCGATCCTG
GCGGTCGGCG TCTTCTACAA GGACATCTCG ACCTACATCG TCCAGGGCGC GTTCGTGGAG
CGCCAGCCGC TGCAACTGAA CGATCCGACC GATGCGCGGA TCACCAACGC GGCCAACAAC
TGCGTGCTGA CCGGGACCAA CCTCTACACC TGCAACTACC GCATCGGCCG CCCGATCAAC
GTGGCCGGCG GCGAGAACAA GGGCGTCGAG GTCAACTTCC AGATGCCGCT CTGGAACGGC
TTCGGGGTCC TGGCCAACTA CACCTACTCC GACGCCTCGT CCTCGTCGGG CGTGCCGGTG
CCATCGAACT CCAAGAACAT CTTCAACGTC AGCGGCTACT ACGAGAACGA GCGCTTCAGC
GCCCGCCTCT CGTACAACTA CCGGTCCAAG TTCTTCGTCG ACTACGACGC CGAGCGCGGC
TTCCGGCAGC TGTGGTCCGA CTCGATCAAG TCCCTGGACG CCTCGGCCAG CGTCAATCTG
ACCGAGAACA TCTCGCTCAG CCTCGATGCG ATCAACCTGA CCGACGAGAA GCTGACCGAG
AACTACGACA ACGATCCGAA CCGTCCGGCC CGTCTCTACA AGAATGGCCG CATGGTGTTC
GGCGGCGTCC GCTTCAAGTT CTAG
 
Protein sequence
MRHFHKTILT ASASVLAVAM AAPAFAQDNT TVDAVVVTGI RASLQASIDS KRNANAIVDV 
ITAEDIGKFP DKNVAESLQR VPGVTIQREF GEGERISVRG TAPTLNRTLL NGHAVATADW
FILDQFKASR SFNYLMLPSE IVGKVEVFKS PVADIDEGGV GATVDVHTRK PLDLAPIAIS
GSIQGVWDEK SDKTTPTASA MLSWHNDAKT FGVLGGLIYD KREIRRDGIE VLGYENVTNA
SGNVTGLPAG QTAVTPSLIG SSYFKQTRER KGANFSIQWA PTDKFEVELT GLYSKMDADN
INQNYMAWIT NKISGLGTPG ANFNIQSLNN GTATKGTFNA IGGQGVVFDA IDRIAHTQVR
SIDLNSTWRP AEGWEFHGRV GYTDAEGSTD LQPFWETSAA TGLTYDLSGG LPKVSFSTIN
PATADDEMEL GWASANTTAN DDNEFYTFVD GEKFMDAGAF TSVKFGLKYT DHDRDVDQTY
GQRRALLPWT APGATACGGH PCSLADVQGG LTPSDYLDGI KGPGVISSYL TADKNKIEAI
YNKLPKAAVW NSAIGATQAA GCDGLLNCDH FGPLESFTFN EKTFGGYVLG KLKGDNWRGN
VGLRIVQTKV DTSAWKVGVP AGTAGAVNNP FGPIAPVDAG QKYTDILPSA NFSFDVRDDL
VLRLAAGRTL SRPDYAQMAA FTSLTESLLT GSGGNPNLDP YRANQYDASL EWYFAPQSIL
AVGVFYKDIS TYIVQGAFVE RQPLQLNDPT DARITNAANN CVLTGTNLYT CNYRIGRPIN
VAGGENKGVE VNFQMPLWNG FGVLANYTYS DASSSSGVPV PSNSKNIFNV SGYYENERFS
ARLSYNYRSK FFVDYDAERG FRQLWSDSIK SLDASASVNL TENISLSLDA INLTDEKLTE
NYDNDPNRPA RLYKNGRMVF GGVRFKF