Gene Caul_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1790 
Symbol 
ID5899245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1891092 
End bp1893542 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content66% 
IMG OID641562280 
ProductTonB-dependent receptor 
Protein accessionYP_001683417 
Protein GI167645754 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0607055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACC AACGGCTTTT GATGGCGGGA TCTTCCGTCG CTGTTCTGCT CGCCCTGTCC 
AGCCAGGCCT TCGCGGCCGA CCCTGTCGCG GCCGATACGT CGGTGGCCGT CGATGAGATC
GTGGTCAAGG CGCGGGACAA GGCGGGTCTG CTGGAGACCC GGCCCAACAA CACGGTGTTC
GGGCTGGACA AACCGCTGCT GGAGACGCCG CGCTCGGCCA GCTTCGTCAG CGACACCACC
CTGCAACGCT ACGGCATCGA GACGATCGAC GGCCTGACCG CCGTCTCGCC CGGCACCTAC
ACCGCCAGCT TCTACGGCGT GCCTGGCGCG CTGAACATCC GCGGCACCCT GGCCGAGAAC
TATTTCCGCG GCTTCAAGCG CATCGAGAAC CGCGGCACCT ATTCGACCCC GATCGGCGCG
GCCGACCAGA TCCAGATCGT CCGCGGCCCG CCGACCCCGA TCTACGGCTC GGGCAAGGTG
GGCGGCATGC TCAACTTCAT TCCCAAGTCA GGGAAGAACG AGGGCGGCTA TCTGTCGGAA
CCGACCGGCG AGGTGACCGC CACCTACGGC TCGTACAACA AGAAGAACGC CACCGCGCAG
CTGGGCCTGC CGGTGAACTT CGGCCCCGTG ACCGGCGGCG TCTACGCCTA TGGCGAGGTC
GAGGACAGCC ACAGCTTCTA CAAGGGCGTC TATCCGCGTC GACAGACCGG CGAGATCTCG
GCCGACTTCG ACCTGGGCAA CGGCTGGAGC ACGGCGTTCG GCGGGATGAA GTACCACTCG
GACGGCGACG TGCAGACGCC GGGCTGGAAC CGCCTGACCC AGAGCCTGAT CGACACCGGG
ACCTACACCA CCGGCCGCGA CACCACCCTG GTCGACAGCG ACCACAACGG CCGCATGACC
CTGAACGAGA TCAGCGGCAA CAGCGCCAAC CCCTACTATT ACGACCCGGC GTTCAACCCG
CTCTACATCC CATACTACAA CTTCTACAAC ACCAACGCCG CCCACGTCCT GGACGCTGGC
GTCGGCGCGA CCAAGCTGTC GCCGCGCACG GTCTATATCA GCCCGGCCGA CTTCTCGAAG
ACCGACACCA ACACCCTCTA TTTCGACCTG GCCAAGACCC TGTCGCCGAG CAGCACGATC
AAGGCCCAGC TGTTCTACGA CGACCAGGAG AACAAGCGCT TCGTCTCGTA CGGCTACCCA
GCCTGGTTCG ACAGCTCGGT CTGGGAAGCG CGCCTGACCT ACAATTTCGA AAACGAGTTC
ATGGACGGGG CGGTCAAGGC CAAGTCGTTC ATCGGGGCCT CATACCGCGA CTTCTCGGGC
CGCCGGCGCG AAAGCTACAA CAGCGGCGTG ATCGCTCTGG ACCGTCGCGA CATCAGCTAC
GGCGCCACGG CCACCGACAT CATCGACAGC CCGTTCACCA CCGAGACCGG TTCTGGCGTT
CTGGGCCTGG CTTGGGAAAA CGACAACAAG TCCGACTGGC AACAGAAGGG CGTGTTCTTC
ATGAGCGACG TCACGGTCGG CGAGAAGCTG AACCTGATGG TCGGCGGTCG CTACGACGAC
TACGACGTCA AGTCGCACGA CACCGGCGTG CTCAGCTACC AGGTGTCGGG CGAGCAGAAG
GCCAGCAAGG GCAAGTTCAC CTACACCGCC AGCGCCACCT ACAAGGCTCC GGCCGGGGTG
ATGCCTTACA TCACCTACGC CAAGGCCTCG GCGCTCGAGA TGAGCCAGGC CGGCGACGTC
GCCGCCAGCC TCGTCGCCGA CCAGAGCGAC GCCTGGCTGT CCAACAGCGA CCTGGCCGAG
GCCGGGGTGA AGTTCCAATG GCTGAAGGGC ACCCTGGTCG GCTCGCTGGC CGGCTACCGC
CAGAACCGCA CCCAGCTGAC CGGCATCAGC GGCACGCCGA CCGGCACCCG CGCCAAGGGC
GTCGAGATGG AAGTCCGCTG GCTGGCCAGC GAGAACTTCA GCTTCACCTT CTCGGGCAAC
ACCCAGCACA CCACGGTCAA GGGGCCGGAC AATTCGTTCC AGTACATCCC GGCCTACACC
GCCGGCGTCC CGGGCTCACA GGCCTTCGGC GGCACCTATG TGGTCTGGGC CTTCAGCGGC
CTGGCGGGCC GCGCGGGCGA CTACGACTAC ACCCTGATCC CCAAGTCGGT GGTCAGCCTG
TACGGCGCCT ATACCAGCGA CGACCACGAC TGGGGCAAGG TCGGCGGCGC GCTGGGCGTC
ACCCACGTGA CCAAGACCTC GGGCACCGTC CAGAACGCCG TGACCTACCC GGCCTACTAC
GTCGCCAACG CCTCGGCCTA CTACGAGTAC GGGCCGTACA CGGTGACGGC CAACATCGAT
AACCTGTTCG ACAAGCTCTA CTTCACGCCC GACGCCGACA GCTACGCCAA CCTTGGCGCG
CTGCCCAGCA AGGGCCGCGA GTGGCGCGTG ACCCTGTCGC GCAAGTTCTA G
 
Protein sequence
MSNQRLLMAG SSVAVLLALS SQAFAADPVA ADTSVAVDEI VVKARDKAGL LETRPNNTVF 
GLDKPLLETP RSASFVSDTT LQRYGIETID GLTAVSPGTY TASFYGVPGA LNIRGTLAEN
YFRGFKRIEN RGTYSTPIGA ADQIQIVRGP PTPIYGSGKV GGMLNFIPKS GKNEGGYLSE
PTGEVTATYG SYNKKNATAQ LGLPVNFGPV TGGVYAYGEV EDSHSFYKGV YPRRQTGEIS
ADFDLGNGWS TAFGGMKYHS DGDVQTPGWN RLTQSLIDTG TYTTGRDTTL VDSDHNGRMT
LNEISGNSAN PYYYDPAFNP LYIPYYNFYN TNAAHVLDAG VGATKLSPRT VYISPADFSK
TDTNTLYFDL AKTLSPSSTI KAQLFYDDQE NKRFVSYGYP AWFDSSVWEA RLTYNFENEF
MDGAVKAKSF IGASYRDFSG RRRESYNSGV IALDRRDISY GATATDIIDS PFTTETGSGV
LGLAWENDNK SDWQQKGVFF MSDVTVGEKL NLMVGGRYDD YDVKSHDTGV LSYQVSGEQK
ASKGKFTYTA SATYKAPAGV MPYITYAKAS ALEMSQAGDV AASLVADQSD AWLSNSDLAE
AGVKFQWLKG TLVGSLAGYR QNRTQLTGIS GTPTGTRAKG VEMEVRWLAS ENFSFTFSGN
TQHTTVKGPD NSFQYIPAYT AGVPGSQAFG GTYVVWAFSG LAGRAGDYDY TLIPKSVVSL
YGAYTSDDHD WGKVGGALGV THVTKTSGTV QNAVTYPAYY VANASAYYEY GPYTVTANID
NLFDKLYFTP DADSYANLGA LPSKGREWRV TLSRKF