Gene Caul_2509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2509 
Symbol 
ID5899964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2723571 
End bp2725982 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content67% 
IMG OID641563000 
ProductTonB-dependent receptor 
Protein accessionYP_001684134 
Protein GI167646471 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0337373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTA AAAGCTTCCT TCTGGCCGCC ACATCCGTCG CCGCCTTCGC CGGCGCGGCC 
CATGCCCAGG ACGCCGCGAC GCCGGCGGAC GATTCGACCG CCGTTGACCA GATCGTCGTC
ACAGGCACTC GGGTCGCCCG CTCGCGGCTC GACACCGTCT CACCGGTCGA CGTCGTCGAC
AACAAGGCGC TGACCCGGCA AGGGACGCCA GAACTCGCCC AGGCCCTGGC CAACCTGGCG
CCGTCGATCG ATTTCCCCCG CCCCGCCGTG ACCGACGGCA CCGATTCGGT GCGTCCGGCC
ACCCTGCGCG GCCTGTCGCC CGATCAGACC CTGGTCCTGC TGAACGGCCA CCGCGCCCAC
ACCGCGGCCT TGGTCAACAT CAACGGCTCG ATCGGCCGCG GCTCGGCCCC ATTCGACCTC
AACACCATCC CAGTCGCCGC GCTCGACCGG GTCGAGATCC TGCGCGAGGG CGCGGCCGCC
CAATACGGTT CGGACGCCAT CGCCGGGGTG ATCAACCTGC GTCTGCGCGA AGCCAGCCAC
GGCGGCGGCG CCAGCGCCAC CTACGGGATC TATGACACCA AGGTGCAGAC GTCCCGCGAC
ACCGACGGCC GCAAGGCCCA CGACGGCCCG ACCTATAGCG CTTCGCTGTG GCAAGGTTTC
GCCCTGCCCA ACGAGGGCTT CCTGACGGTC ACCGGCGAAT ATTCGTTCCG CAATCCGACC
AACCGCGCCG ACCGCGATCC GCGCGTCACT CCGAGCAAGG TGACGGGGGT CTATGGCGAC
CCGCAGGTCG AGACCAAGAC GATCTACGCC AATTTCGGCC TGCCGCTGAA CGACGCCTGG
AGCCTGTACG GCCTGGCCGG CTATCAGAAC CGCAAGGGCG AGAGCTCGGC GTTCCCGCGT
CTGGCCGACA ACGCCAACAA CTACACCAGC GTCTATCCCA CCGGCTACGT GCCGCGGATC
ACCACCAAGA TCGACGACTA CAACCTGGCT TTCGGGACCA AGGGCGTGAT CGGCGGCTTC
AACATCGACG CCAGTGTCAC CTACGGCAAC AACAAGGTCG AATACGGCAC CATCAATTCG
CTCAACGCCT CGCTTGGCTC GACGTCGCCG ACCCGCTTCA AGGACGGCTC GATGCAGTAC
GGCCAGACGG TGGCGGCGAT CGACGTCAAC CGGCCGCTGG AACTGTTCAG CTTCGCCAAG
CCCAGCACCC TGGCGTTCGG GGTCGAGTAC CGCGACGAGA CCTACAAGAT CGAGGCCGGC
GAAACCGCCT CGTGGATCAA TGGCGGCAAC GGCAAGGGCG CTGGCGCCCA GGGCTTCCCG
GGCTTCCGCC CAGCCAACGC GGTCGATGTG AGCCGTAACG CCACCAGCCT CTATGTCGAC
CTGGACAACC AGATCACCGA CAAGCTCGAT ATCGACCTGG CCGCCCGCTA CGAGGACTAT
TCGGACTTCG GCTCGACCAC GACCGGCAAG GTCGCGGCTC GCTACGACTT CACCGACAAC
TTCGCCCTGC GCGGGGCCTT CTCGTCGGGC TTCCGGGCTC CGGCTCTGCA GCAGCAGTAC
TTCACCACCA CCTCGATCAA CATCGTCGCC GGCGGCGCGG CGGTCGACGT CGGCACCTTC
CCGGCCACCA GCGCCACCGC CGCCGCCCTC GGCGCCAAGC CGCTGGAGCC GGAAAAGTCG
AAGAACTATT CGTTGGGCGC GATCTACCAC AAGGGGCCGT TCGAGCTGAC GGTCGACGCC
TACCAGATCG ACATCGACAA CCGCATCGTG CTGTCGGAAA ACATCGCCGG CTCGGCCACG
GGCACACCGA CCCAGCGGGC GATCTTCAAC CTGCTGCAGC CATTTGGCGT GACCACGGCG
CGGTTCTTCG TCAACGGCGT CGACACCACC ACCAAGGGCG TCGATGTGGT CGCCCGCTAC
CGGCTCGACG CCGACGCGGC GGGCCGCTTC GACTTTACCC TGGCGGCCAA CTACAACCAG
ACCGACGTCA ACAAGGTGCC GACCACCAGC ACCCTGTCGG GCCTGCCGGT CCCGCCGCCG
CTGTTCGCCC GCGTCAACGT CCTGACCTAT GAGCAAGGCA CGCCCGACCA CAAGTTCGTG
GCCTCGGGCG ACTGGAGCAA CGGCCCGTGG GGCGCCACCC TCAAGGGCAC GGCCTACGGC
TCGGTGCTGG TGCCCAACGC CACCCCGGCG CTGGATTATA AGTCGGGCGC CAAGACCGTC
TGGGACGCCG AGGCCCGCTA CACCTTCCTC AAGGACATCA CCTGGGCCCT GGGGGTCAAC
AACCTGTTCG ACGAGTACCC CGATCGCGCC CCTGCCCCGG TCAACACGAC CGGTGTGGTC
GCCTTCCCCA GCTATTCGCC GTTCGGCTTC AATGGGCGGT TCCTCTACAC GCGGCTAAGC
TACAGCTGGT AA
 
Protein sequence
MHFKSFLLAA TSVAAFAGAA HAQDAATPAD DSTAVDQIVV TGTRVARSRL DTVSPVDVVD 
NKALTRQGTP ELAQALANLA PSIDFPRPAV TDGTDSVRPA TLRGLSPDQT LVLLNGHRAH
TAALVNINGS IGRGSAPFDL NTIPVAALDR VEILREGAAA QYGSDAIAGV INLRLREASH
GGGASATYGI YDTKVQTSRD TDGRKAHDGP TYSASLWQGF ALPNEGFLTV TGEYSFRNPT
NRADRDPRVT PSKVTGVYGD PQVETKTIYA NFGLPLNDAW SLYGLAGYQN RKGESSAFPR
LADNANNYTS VYPTGYVPRI TTKIDDYNLA FGTKGVIGGF NIDASVTYGN NKVEYGTINS
LNASLGSTSP TRFKDGSMQY GQTVAAIDVN RPLELFSFAK PSTLAFGVEY RDETYKIEAG
ETASWINGGN GKGAGAQGFP GFRPANAVDV SRNATSLYVD LDNQITDKLD IDLAARYEDY
SDFGSTTTGK VAARYDFTDN FALRGAFSSG FRAPALQQQY FTTTSINIVA GGAAVDVGTF
PATSATAAAL GAKPLEPEKS KNYSLGAIYH KGPFELTVDA YQIDIDNRIV LSENIAGSAT
GTPTQRAIFN LLQPFGVTTA RFFVNGVDTT TKGVDVVARY RLDADAAGRF DFTLAANYNQ
TDVNKVPTTS TLSGLPVPPP LFARVNVLTY EQGTPDHKFV ASGDWSNGPW GATLKGTAYG
SVLVPNATPA LDYKSGAKTV WDAEARYTFL KDITWALGVN NLFDEYPDRA PAPVNTTGVV
AFPSYSPFGF NGRFLYTRLS YSW