Gene Caul_4674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4674 
Symbol 
ID5902136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5052132 
End bp5054210 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content68% 
IMG OID641565193 
ProductTonB-dependent receptor 
Protein accessionYP_001686292 
Protein GI167648629 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.396726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAT CCTTCCGCCT GGGCCTGATG GCCGGCGCCA CGTTCTGCCT GCTGTCTTCC 
GCCGCCCATG GCGCTAACTC GGCCGCTGAC GCCGACCAGC GCGCCATCGT CGATTCGGTC
ATCGTCACCG CCCGCGCCAA TCCCGAGGAC CCGCCGGTCG TGGCGGCCGC TCGCCGCCGC
CTGTCGGAAA CGCCCGGCGG GGTGTCGGTG ATCTCCCAGG AATCCTACAT CGACCGCCAG
ACCCTGGCCC TGGACGACAT GCTGCGCGAC GCGCCGGGCG TCTACGCCCA GCGCAAGTGG
GGCGGGGATA TCCGCATCTC GATCCGCGGC TCGGGGATCG GCAACGCCAA CCACAACCGG
GGCCTGCTGA TCGCCCAGGA CGGGGTGCCG CTGAACGAGG CCGACGGCTT TGGCGACAGC
CAGGTCGCCG ACCCGCTGAA CACCCGCTAC GTCGAGGTCT ATCGCGGCGG CAACGCCCTG
CGCTTCGGCG GGGCCCTGCT GGGCGGGGCG ATCAACATGG TCACGCCCAC GGGCAAGGAC
GCCGGGTTCG ACAATCAGGT TCGCGTGGAC GGCGGCTCCT ACGGCCTGCT GCGCGAGCAC
GTCGCCATCG CCCGCCAGTC TGGAGATGGG GCCGGCGACT GGGACGTCTA TGCCGCCGCC
ACCAACCAGA CCGGCCAGGG CTGGCGCTCG CAGAGCCAGC AGAACATCCA GTTCGGCAGC
CTCAATATTG GGCGATCGTT CGGCGAGGGG CGCGAGGTCC GCTTCATCGT CAACGGCTCG
AACATCAATC AGGAGATCCC CGGTTCGCTG ACGCTGGACC AGTTCAACCG GAACCCGCGC
CAGACCGCCT TGGGCAACGC CAACGGCGAC CAGGGCCGAA ACCAGCGCGG CGTGCGCGGC
TCCTTGGGAA CGACCTGGCG GCTCGGCGAC AATCTAGTCT TCCAGGGCGC GGTCTATGCG
GTGTGGAAGG ACCTGGACCA CCCGATCTTC CAGGTCATCG ACCAGCAGAG CCGCAACTAC
GGCGCCTTCG GCCGCTTCGA CTGGGAGGGC CAGATCGGCG GCAAGCGGGC CGACGCCTTC
TTCGGCGCCT GGTACCGCAC CGGCGACATG GACTCCAACT TCTACGCCAA TGTCCGCGGC
GCCCGCGGCG CGCCCATGTC GCGGACCCTG CAGAACGCCA AGGCCATGGA TGTGTTCGGC
GAAGGGCGCC TGTTCGTCAC CGACCGCCTG GCCGTGGTGG CCGGCGCGAC CTGGGGCCAG
GCCGAGCGCG ACTACACCAG CTTCGCCGTT CCGGGCGTGG CGAGCACGTT CAACCTGAAG
GCCGACAAGA CCTACAACTG GGTGTCGCCG CGCATCGGCC TGCTCTGGCA GGGCGAGGCG
GGCGACCAGC TGTTCGCCAA CCTGACCCGC TCGGTCGAGC CGCCGAACTT CAGCTCGATG
ACGCCGACCA ACACCGGCTT CACGCCGGTC CGCGCCCAGA AGGCCTGGAC CGGCGAGTTC
GGGGCGCGCG GCCACAAGGG TCCGTTCACC TACGACGTCA CGCTGTACCG CGCCGATCTG
AAGGACGAGA TGCTGCAATA TGCGGTGAGT TCGTCCATCC CGGCCTCGAC CTTCAACGCC
GACAAGACGG TTCATCAAGG CATCGAGGCG GCGCTGGACT GGACCGTGGC CCCACATTGG
CGCCTCCGCC AGACCTGGAC GCTTTCGGAC TTCCGCTTCA AGGACGACGT CCAGTTCGGC
GACAATCGCC TGCCGATCGT GCCCCGGACC TTCTACCGCT CGGAAGTGCG CTACGACGAC
CCGCGCGGCT GGTTCGTGGC CCCGTCGGTC GAGTGGTCGG CGAGCGATCA GTGGATCGAC
TACGCCACCA CCAAAAAGGC GCCGGGCTAT GCGATCCTGA ACCTGAACGC CGGCTGGAAG
GTCAGCGACC AGGTTTCGCT GTTCCTCGAT GCGCGCAACC TGGCGGACAA GGCCTATGTG
TCCAACACCC AGGCCGCCGT CACCTGGACC CCGACAACCG CCACGCTGTG GCCGGGAGAC
GGCCGGTCGG TGTTCGGCGG CGTCACGGTC GATTTCTGA
 
Protein sequence
MSKSFRLGLM AGATFCLLSS AAHGANSAAD ADQRAIVDSV IVTARANPED PPVVAAARRR 
LSETPGGVSV ISQESYIDRQ TLALDDMLRD APGVYAQRKW GGDIRISIRG SGIGNANHNR
GLLIAQDGVP LNEADGFGDS QVADPLNTRY VEVYRGGNAL RFGGALLGGA INMVTPTGKD
AGFDNQVRVD GGSYGLLREH VAIARQSGDG AGDWDVYAAA TNQTGQGWRS QSQQNIQFGS
LNIGRSFGEG REVRFIVNGS NINQEIPGSL TLDQFNRNPR QTALGNANGD QGRNQRGVRG
SLGTTWRLGD NLVFQGAVYA VWKDLDHPIF QVIDQQSRNY GAFGRFDWEG QIGGKRADAF
FGAWYRTGDM DSNFYANVRG ARGAPMSRTL QNAKAMDVFG EGRLFVTDRL AVVAGATWGQ
AERDYTSFAV PGVASTFNLK ADKTYNWVSP RIGLLWQGEA GDQLFANLTR SVEPPNFSSM
TPTNTGFTPV RAQKAWTGEF GARGHKGPFT YDVTLYRADL KDEMLQYAVS SSIPASTFNA
DKTVHQGIEA ALDWTVAPHW RLRQTWTLSD FRFKDDVQFG DNRLPIVPRT FYRSEVRYDD
PRGWFVAPSV EWSASDQWID YATTKKAPGY AILNLNAGWK VSDQVSLFLD ARNLADKAYV
SNTQAAVTWT PTTATLWPGD GRSVFGGVTV DF