Gene Caul_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3922 
Symbol 
ID5901384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4240152 
End bp4242998 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content66% 
IMG OID641564443 
ProductTonB-dependent receptor 
Protein accessionYP_001685545 
Protein GI167647882 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.719329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0833572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTGC AGATCACCCG CGGACGCCTT CTGGCGACGA CGATGATGGC CGGGGTCGCC 
ACCTTGGCCG CGTTCACCGC CAACGCCCAG ACCGCCCCGG CGCCCGCCCC TGCTCAGGCG
GATAACGAAG TCGAAGCCGT CGTCGTCACC GGTTCGCTGC TGCGTCGCAC CGACACCGCC
ACCCCGTCGC CGGTCACGGT TCAGACCACC GAGCAGCTGA AGGCCCAGGG CATCACCACG
ATCGCCGACG CCATCCGCAG CCTGTCGGCC GACAACTCCG GTTCGATCCC CGCCGCCTTC
GGCAACGGCT TCGCGGCCGG TTCGACCGGC GTGTCGCTGC GCGGCCTGTC GGTCAACTCG
ACCCTGGTGA TGATCGACGG CCTGCGCAAC GCCAACTACC CGCTGGCCGA CGACGGCCAG
AAGGCGTTCG TCGACCTCAA CTCGATCCCG TTCAACGCGG TCGAGCGGAT CGAGACCCTC
AAGGACGGCG CCTCGTCGCT GTACGGCGCC GACGCCATCG GCGGCGTGGT CAACATCATC
ATGAAGTCGA ACTACCAGGG CATGGGCGCC GACATCTCTT ACGGGTCCAG CCAGCACGGC
GGCGGCGACC AGTACCGTTT CACCGGCGAC ATCGGCTACG GCGACCTGGA TACCGACAAG
TACAACCTGT ATTTCGACGT CGAGTACCAG CTGGACAAGG CCATCCGCGG CGACCAGCGC
GGCTTCCCGT ACAACACCAA CGACCTGACC TCGATCCCCG GCGGCCAGAA CAACAACCCG
CAGACCGGCG GCAGCACCTT CTACGGCAAC GTCCTGCGGG CCAGCCTGGG CACGCCCGGC
AACCTGGCCA CCGGCGTGGG CATCGACGGC GCCCTGTGGC AGCCGCTGCG GACTTGCGCC
GCCAACGCCC CGCTGACCAC GCAGAAGGAT GAAGACGGCA ACGTGATGGG CGCCTACTGC
GCCCAGAACG TCGCCAATTT CGGCGACATC CAGCCAAAGC AGTCGCGCGC CGCGGTCTAT
GGCCGCTTCA CGATCAAGCC GACCGACCAT CTGGAAGCCT ATCTGAGCGG CAGCTTCGTG
CAGAGCAAGA CCTCGGTCCG CGGCACCCCC GTGACCGTCT CGTCGAGCAC GCCCAACAAC
CTGACCAACC TGGTCCTGCC GGTGTGGATC TGCGACACGG GCGTCAACTG CGCCACCGAC
ACCACTGCGG TGAACCGTCG CCTGAACCCG AACAATCCGT TCGCGGCCGA CGGCGACTAC
GCCCTGATCA AGTACCGCTT CAGCGACCTG CCGGCCTCGG CCCGCTACAA CAACCGCATG
CTGCGCATGG TCGGCGGCGT GAAGGGCGAT GTGGCGGATT GGAACTACCA GGCTAACCTG
GTGATCGCCC ATGACTCGCT GGAAAGCGTG CAGACCGGCT TCCTCAGCTA CAAGCAGCTG
ATGAGCGACA TCCAGACGGG CGCCTACAAC TTCGTCGATC CGTCGAAGAA CAGCGCGGCT
GTTCTCAAGG CCCTGTCGCC GGATCTGGCC AAGACCTCGA CCACCGACCT GGACTCGATC
AACGTCCAGG CCTCGCGTCC GGTCTTCAGC CTGCCGGGCG GCGACGCTCA GTTGGGCCTG
GGCGGCGAGT TCCGCTACGA AGCGACCAAC GACCCGGCCC TGAACCCGGT CAACGACGCC
CAGGGCCTGG GCAACGCCCG CACCGAGGGC AACCGCACCG TCGCCTCGGC CTTCGCCGAG
CTGGGCCTGC CGATCACCGA CAAGATCGAG GTCAACGTCT CGGGCCGCTA CGACCACTAT
TCGGACTTCG GCGGCAACTT CTCGCCGAAG ATCGGCGTGA AGTTCACGCC CATCAAGACG
GTCGCCCTGC GCGGCACCAT CTCAAAGGGC TTCCGGGCGC CGAGCTTCTC CGAAGCGGGC
AACTCGGCCA GCCAGGGCTT CACGACGTTC TCGTTCAAGT CGGCCAAGTA CGCCGATTTC
CGCGCCTCGC ACGGCAACAA CGAGTACACC AAGGACTACT CGCTCTCGTC GATCACCACG
GCCAATCCGG ACCTGGATCC GGAAAAGTCG ACCAGCTACA CCCTCGGCGC GGTTTGGGCC
CCGACCCGCA GCTTCAGCGT GTCGCTGGAC TACTACCACA TCAAGAAGAC CGACGTGATC
GCGCAAGCCA GCGCCGGCGT GGCCCTGGCC GCCTACTACG CGGGCCAAGC CATCCCGGCT
GGCTACGTCG TCACAGCCGA CGCGGCGGAC CCGCAGCATC CCGACGCCCT GGCGCGCGTG
CTGTCGGTGG CCTCGCCCTA CATCAACGCC GACTCGCTGG TGACCAGCGG TCTGGACCTG
AACGTCCAGG CCACGTTCTA CCTGCCGGCC GACGTGAAGT GGACCAGCAA CCTCGACGCC
ACGACCCTGT TCGACTTCAA GTACACGCAG GACGGCACGA CCTATAACTA CGTCGGCAAG
GAAGCGCCGT ACGTGCTGTC GTCGGGCGCC GGCACCCCGA AGAACCGCCT GTCGTGGACC
AACACCTTCG AGCATGGCCC GATCCACGTG ACCGGCGTCA TGAGCTATGT CAGCGGCATG
CGTGAAGTCG ACGACAGCTA CGACAGCGCC TGCCTCTATG GCGATGCGGC CTTCAACTGC
CGCGTCAAAT CGTTCACCAC GGTCGACCTG ACCGGCGCCT ACGACGTGAC CCAAAAGGCC
ACCGTCTACG CCGACGTCAT GAACCTGTTC GACACCGGCC CGATGTTCAA CCCGGCCAAC
TACGCCGGCG TGAACTGGAA CCCGACCTAC TCGCAGTCGG GCATCGTGGG CCGCTACTTC
CGCGTCGGGG TGCGCGTGAA GTACTAG
 
Protein sequence
MRLQITRGRL LATTMMAGVA TLAAFTANAQ TAPAPAPAQA DNEVEAVVVT GSLLRRTDTA 
TPSPVTVQTT EQLKAQGITT IADAIRSLSA DNSGSIPAAF GNGFAAGSTG VSLRGLSVNS
TLVMIDGLRN ANYPLADDGQ KAFVDLNSIP FNAVERIETL KDGASSLYGA DAIGGVVNII
MKSNYQGMGA DISYGSSQHG GGDQYRFTGD IGYGDLDTDK YNLYFDVEYQ LDKAIRGDQR
GFPYNTNDLT SIPGGQNNNP QTGGSTFYGN VLRASLGTPG NLATGVGIDG ALWQPLRTCA
ANAPLTTQKD EDGNVMGAYC AQNVANFGDI QPKQSRAAVY GRFTIKPTDH LEAYLSGSFV
QSKTSVRGTP VTVSSSTPNN LTNLVLPVWI CDTGVNCATD TTAVNRRLNP NNPFAADGDY
ALIKYRFSDL PASARYNNRM LRMVGGVKGD VADWNYQANL VIAHDSLESV QTGFLSYKQL
MSDIQTGAYN FVDPSKNSAA VLKALSPDLA KTSTTDLDSI NVQASRPVFS LPGGDAQLGL
GGEFRYEATN DPALNPVNDA QGLGNARTEG NRTVASAFAE LGLPITDKIE VNVSGRYDHY
SDFGGNFSPK IGVKFTPIKT VALRGTISKG FRAPSFSEAG NSASQGFTTF SFKSAKYADF
RASHGNNEYT KDYSLSSITT ANPDLDPEKS TSYTLGAVWA PTRSFSVSLD YYHIKKTDVI
AQASAGVALA AYYAGQAIPA GYVVTADAAD PQHPDALARV LSVASPYINA DSLVTSGLDL
NVQATFYLPA DVKWTSNLDA TTLFDFKYTQ DGTTYNYVGK EAPYVLSSGA GTPKNRLSWT
NTFEHGPIHV TGVMSYVSGM REVDDSYDSA CLYGDAAFNC RVKSFTTVDL TGAYDVTQKA
TVYADVMNLF DTGPMFNPAN YAGVNWNPTY SQSGIVGRYF RVGVRVKY