Gene Caul_3415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3415 
Symbol 
ID5900870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3687726 
End bp3690104 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content68% 
IMG OID641563921 
ProductTonB-dependent receptor 
Protein accessionYP_001685040 
Protein GI167647377 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.151131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC ACCTGATGTT GGGCGGAGCC GCCTGCGCTC TGCTGCTGGC CACGCCGATG 
CTGGCCATGG CCGCCGAACC CGCGGCCGCG CCGGCCCCCG CCGACACCGC TCCGGCCGAC
GCCAACGAAG TCGACGCCGT CATCGTGCTG GGCCAGGGCC AGAGCCGCCA GGTCCAGACG
ATCACCACCA CGCAGATCGA AGTCCTGGCC GCCGGCTCCA GCCCGCTGCG CGCCATCGAG
AAACTGCCAG GCGTCAGCTT CCAGTCGGCC GACGCGTTCG GCGCCTATGA GTGGTCGACC
CGCGTCGCCA TCCGCGGCTT CGACCAGAGC CGCCTGGGCT TCACGCTGGA CGGCGTGCCG
CTGGGCGACA TGAGCTATGG CAACTACAAC GGCCTGCACA TCAGCCGCGC CGTCACCAGC
GAGGATCTCG GCTCGGTCGA GCTGGCCCAG GGCGCCGGCT CGCTGTCGAC CGCCTCGACC
TCGAACCTCG GCGGCACGCT GCGGTTCTTC ACCCGCGACC CGTCCAAGGA CTTCCACGTC
CAGACCAACG CCACGGTCGG TTCGGACAAC ATGTACCGGG TGTTCGGCCG TATCGACACT
GGCGCCATCG AGGGCCTGGG CGGCCTGCGC GCCTACCTCT CGGCCGTCGA CCAGAAGGCC
GACAAGTGGA AGGGCGGCAG CGAGCAGAAG CAGCGCCAGT ACGACGCCAA GGTCGTGCTG
CCCCTGGGCG AGACCGGTTC GCTGACCGCG TTCTGGAACC ACTCCGAGCG CCGCGAGCAG
GACTACCAGG ACATGTCGTT CGACATGATC AAGCGCCTGG GCCGCGACTG GGACAACTTC
CAGCCCGACT GGAACAAGGC GGTCGGCGTG GGCGCGGTGC TGAACAACCC GGCCAACTAC
GCCGGCGCGA CCCCGATCCT GAACGGCGGC TACTGGACCG GCGTCGGCAC GAACCCCTAC
GCCCAGTACG GCGTGGCCAC GCCTGACGAC GCCTATTACG CCGGGGCCGG CATCCGCGAC
GACGACATCT ACGCCGTCAC GCTCAAGCAT GACCTGGGCG AAACCGCCCG CTTCGACCTG
ACCGGTTACG GCCACAAGAA CCAGGGCCAG GGCCTGTGGT ACACGCCCTA CACGGTCAGC
CCCGGCTACG GCACGGCCGG CTCGACCGCC GCCCCGCTGT CGATCCGCAC CACCGAGTAC
GACATCGACC GCAAGGGCCT GATCGGCAAC CTCTATGTCG ACCTGGGCAG CCATGCCCTG
AGCGCCGGCT TCTGGACCGA AGACAACGAC TTCCACCAGG CCCGCCGCTT CTACGCCGAG
ACGGCCGCCG CCCCGTCGCG CGACCCGCTG GACTTCCAGT CCAACCCGTT CAAGACGTCG
TGGGAATTCA AGTTCAACAC CAAGACCACG ACCGGCTACC TGCAGGACGT GTGGACGGTG
AACGACGCCC TGAAGGTCAA TTTCGGCTTC AAGGCGCTGA AGGTCGAGAA CCAGGTGACC
TCGATCGTCG GGTCGGTGAA CGGCAAGATC CAGTCCAAGG ACAGCTTCCT GCCTCAGGTC
GGCGCCGTCT ATAGCCTGAA CAACGGCGTC GAGCTGTTCG GCGGCTATAC CGAGAACATG
GCCGCCTACG TCTCGGCCGC CACCTCGGGC CCGTTCGCCT CGCAGAACCA GGCGGCGGTG
AACTTCATCC GCGACAACCT CAAGCCGGAA ACCTCCAAGA CCTTCGAAGG CGGCGTCCGC
TACAAGACCG ACCGCTTCCA GGGCGTGGCG GCGGTCTATC ACGTCGACTT CGAGAACCGC
CTGGACAGCG CCAGCACGGC TCCGCCGATC CTGGGCCTGC CGGCGGTTCT GTCGAACGTC
GGCGCGGTCA AGACCAAGGG CATCGAACTG GCCGGCCAGT ACCGCCTGAC CGACGCCTGG
TCGCTGTACG GCTCCTACGC CTACAACGAC TCCACCTACA AGGACGACGT CCCCGGCGCC
GGCGGCACGG TGGCCATGCG CACCAAGGGC AAGGACGTGA TCTACACGCC CAAGAACCTG
CTGAAGGCCT CGCTGGGCTA TGACCAGGGC ACGGGCTTCT TCGGCAATCT GGGCGTCAGC
TACACCAGCT CGCGCTGGTA CACCTTCGAG AACGACGGCG GGAAGGTCGA CGGCTTCACC
GTGGCCGACC TGACCGTGGG CTACCGCTTC GACGGCGCCG ACAACGCCTG GCTGCGCGGC
CTGGAAGTGC AGGGCAACAT CACCAACCTG ACCGACGAGG ACTACATCTC GACGGTGGGC
TCGGGCGGCG CGGCCAAGGC CGACCCGACC GGCCAGGCCA TGACCCTGCT GCCGGGCGCC
CCGCGCCAGG CCTATGTGAC GGTGCGCAAG CGGTTCTAG
 
Protein sequence
MKRHLMLGGA ACALLLATPM LAMAAEPAAA PAPADTAPAD ANEVDAVIVL GQGQSRQVQT 
ITTTQIEVLA AGSSPLRAIE KLPGVSFQSA DAFGAYEWST RVAIRGFDQS RLGFTLDGVP
LGDMSYGNYN GLHISRAVTS EDLGSVELAQ GAGSLSTAST SNLGGTLRFF TRDPSKDFHV
QTNATVGSDN MYRVFGRIDT GAIEGLGGLR AYLSAVDQKA DKWKGGSEQK QRQYDAKVVL
PLGETGSLTA FWNHSERREQ DYQDMSFDMI KRLGRDWDNF QPDWNKAVGV GAVLNNPANY
AGATPILNGG YWTGVGTNPY AQYGVATPDD AYYAGAGIRD DDIYAVTLKH DLGETARFDL
TGYGHKNQGQ GLWYTPYTVS PGYGTAGSTA APLSIRTTEY DIDRKGLIGN LYVDLGSHAL
SAGFWTEDND FHQARRFYAE TAAAPSRDPL DFQSNPFKTS WEFKFNTKTT TGYLQDVWTV
NDALKVNFGF KALKVENQVT SIVGSVNGKI QSKDSFLPQV GAVYSLNNGV ELFGGYTENM
AAYVSAATSG PFASQNQAAV NFIRDNLKPE TSKTFEGGVR YKTDRFQGVA AVYHVDFENR
LDSASTAPPI LGLPAVLSNV GAVKTKGIEL AGQYRLTDAW SLYGSYAYND STYKDDVPGA
GGTVAMRTKG KDVIYTPKNL LKASLGYDQG TGFFGNLGVS YTSSRWYTFE NDGGKVDGFT
VADLTVGYRF DGADNAWLRG LEVQGNITNL TDEDYISTVG SGGAAKADPT GQAMTLLPGA
PRQAYVTVRK RF