Gene Caul_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3941 
Symbol 
ID5901403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4264170 
End bp4266548 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content65% 
IMG OID641564462 
ProductTonB-dependent receptor 
Protein accessionYP_001685564 
Protein GI167647901 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.78272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCGA CAAAGCGTAC CGGGTTTTCC AAGGCCGGAA CGGCGGTCGC GGGACTGCGC 
TGGACCACGG CCCTGCTGGC CTCGTCCATG CTGTCCAGCG TGGCGGCCCA CGCCCAGACC
ACCGCGCAGA GCGGCGCCGT CGCCCTCGAG GAAGTGGTCG TGACCGCCCA GAAGCGGTCG
GAAAACCTGC AGGACGTGCC GATCAGCATC CAGGCCCTGG GGACCCAGAC CCTGGAAGAG
CGCCACGTCA GCGGCCTCAA CGACTATGTG AAGTTCCTGC CCAGCGTGTC GATCCAGACC
ACCTCGCCGG GCTTTTCCAG CGTGTTCATG CGCGGCGTGG TCAGCGGCGA CAACGCCAAC
CACTCGGGTC CGCTGCCCAC CGTCGGCACC TATCTGGACG AGCAGCCGAT CACCACGATC
CAAGGCGCGC TCGACGTCCA CATCTACGAC ATCGCCCGGG TCGAGGCGCT GTCGGGGCCG
CAGGGCACGC TGTACGGCGC CAGCTCGATG GCCGGCACGG TGCGCATCAT CACCAACAAG
CCCAGCACCG CGGGCTTCAA GGCCGGCTAC GACATCGAGG CCAACACCGT CTCGAAGGGC
GATCCCGGCT ATGTGGTCGA GGGCTTCGTC AACCAGCCGA TCAACGACAA GGTGGCCGTG
CGCCTAGTCG GCTGGGCCGA GCATGACGGC GGCTATATCG ACAACGTGGC GCAGTCGCGC
ACCTTCCCGA CCTCGGGCGT GACCCGCAAC AACGCCAGCC AGGTCAAGAA GAACTACAAC
GACGTCGACA CCTACGGCGC GCGGGCGGCC CTGCGCCTCG AGCTCGGCGA CAACTGGGTG
ATCACCCCCA CCGTCATGGG CCAGCAAGAG AAGGTGCACG GCCTGTTCGC CTTCGACCCC
ACCCTCGGCG ACCTGAAGGT GGCGCACTAC TATCCCGAAG GCTCCAACGA CAAATGGGGC
CAGGCGGCCC TGACCATCGA GGGCCAGATC AGCAACTTCG ACGTGGTCTA TGCCGGCTCG
TACCTGAAGC GCACTGTCGA CAGCACGTCG GACTACAATG ACTATTCGTT CTTCTACGAC
ACCCTGACCA CCTACGGGCA GTACTATACC GACAACGCCG GCAACCCGGT GGATCCGTCG
CAATACATTC AGGCCCGGGA CGGCTACACC AAGCAGAGCC ACGAGATCCG CATATCGAGC
CCGCAGGAGA ACCGCGTCCG CTTCGTCGGC GGCCTGTTCT ACCAGCAGCA GAAGCACGAC
ATCTTCCAGA ACTACAAGAT CGACGCCCTG GGGACGACCT ATTCGGTGAC GGGCTATCCC
CACACCGTGT GGCTGACCAA GCAGAAGCGC GTCGACGAGG ACCAGGCGGC GTTCGGCGAG
GTGGCGTTCG ACATCACCGA CAAGCTGACC CTGACCGGCG GCGTCCGCTT CTTCAGGGCG
CACAACAGCC TGAAGGGCTT CTTCGGCTTC GGGATCGATT TCCCGTTCAG CTCCGGCGAG
AAGCAGTGCT TCGGCCCGCC GGTCGTCGAC GGTTCGCCCT GTACGAACCT GAACAAGAGC
ATCAAGGAGA ACGGCAACTC GCCCAAGGTC AACCTGACCT ACAAGATCGA CTCCGATAAG
CTGGTCTACG CCACCTATTC CGAGGGCTTC CGGCCGGGCG GCATCAACCG CCGCGGCACG
GTTCCGCCCT ACACCGCCGA CTTCCTCAAG AACTACGAGG CGGGGTGGAA GACCTCCTGG
CTCGAGAACC GGGTGCGCTG GAACGGCGCG GCCTATATCG AGGACTGGAA GGACTTCCAG
GTCGGCCTGC TGGGCGCCAA CAGCCTGACC GAGGTGCACA ACGCGGGCAG CGCGCGGGTC
AAGGGCGTCG AGACCGACAT CAATTGGGCG GTCATGCGAG GTTGGACGGT CTCCGGCTCG
GCCGCCTATA CCAGCGCCCA CCTGACGGAG AACTTCTGCC TGGCGCTGGT CAACGGCCAG
CAGGTCACCA ACTGTCCCAA TCCCGAGGCG CCCAAGGGCT CGGTCCTGCC GATCACTCCG
AAGTTCAAGG CCAACCTGAC CAGCCGCTAC GAATGGGACA TGGGTGAGTA CCGCGCCCAC
GTCCAGGCGG CCGGGGTCTA TCAGACCGGC AGCTGGACCG ACCTGCGGAT CATCGAGCGC
GGGTGGATCG GCCGGCAGAA GGCCTATGGC GTGCTGGACC TGACCACCGG CGTCGATCGC
GACAACTGGA GCCTGGAGCT GTTCGTCAAG AACGCCTTCG ACAAGCGCGC TAGCCTCTAT
CGCTACGCCG AGTGCAACGA GTCGATCTGC GCCGCCCAAG TCTATCAGGT GCCCAACCAG
CCGCGGTTCA TGGGCGTCAA GTTCGGCCAG AAGTTCTAG
 
Protein sequence
MVSTKRTGFS KAGTAVAGLR WTTALLASSM LSSVAAHAQT TAQSGAVALE EVVVTAQKRS 
ENLQDVPISI QALGTQTLEE RHVSGLNDYV KFLPSVSIQT TSPGFSSVFM RGVVSGDNAN
HSGPLPTVGT YLDEQPITTI QGALDVHIYD IARVEALSGP QGTLYGASSM AGTVRIITNK
PSTAGFKAGY DIEANTVSKG DPGYVVEGFV NQPINDKVAV RLVGWAEHDG GYIDNVAQSR
TFPTSGVTRN NASQVKKNYN DVDTYGARAA LRLELGDNWV ITPTVMGQQE KVHGLFAFDP
TLGDLKVAHY YPEGSNDKWG QAALTIEGQI SNFDVVYAGS YLKRTVDSTS DYNDYSFFYD
TLTTYGQYYT DNAGNPVDPS QYIQARDGYT KQSHEIRISS PQENRVRFVG GLFYQQQKHD
IFQNYKIDAL GTTYSVTGYP HTVWLTKQKR VDEDQAAFGE VAFDITDKLT LTGGVRFFRA
HNSLKGFFGF GIDFPFSSGE KQCFGPPVVD GSPCTNLNKS IKENGNSPKV NLTYKIDSDK
LVYATYSEGF RPGGINRRGT VPPYTADFLK NYEAGWKTSW LENRVRWNGA AYIEDWKDFQ
VGLLGANSLT EVHNAGSARV KGVETDINWA VMRGWTVSGS AAYTSAHLTE NFCLALVNGQ
QVTNCPNPEA PKGSVLPITP KFKANLTSRY EWDMGEYRAH VQAAGVYQTG SWTDLRIIER
GWIGRQKAYG VLDLTTGVDR DNWSLELFVK NAFDKRASLY RYAECNESIC AAQVYQVPNQ
PRFMGVKFGQ KF