Gene Caul_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4007 
Symbol 
ID5901469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4337537 
End bp4338706 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID641564528 
ProductTonB-dependent receptor 
Protein accessionYP_001685630 
Protein GI167647967 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.413023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGGCG GCTTCTACCT GCACCGCGAC ACCGACCTCA ACGGCGAGGA CCGGTCCAGC 
TCGGCCTTCC TCACCACCCG CCGCCTCACC GGTCTGCCGG GCGCGACCTT CGCCAAGTTC
GGCGCCGACA CGCGCACCTA CGAACTGGCC GGCTTCGGTG AGCTCACCTA CCACCTGACC
GACAAGCTGT CGGCCACCGG CGGTCTGCGC TACGGCAAGT ACGGCGGCAC GGTGGACACC
TATGCCGGCT TCAACACGGC GTATTTCACC TACGCGCTGC TCGGCTTTTC CGGGCCGCTG
GCGCTCACCC CGTCTCCGGC CTCGACCACG AAATACCCCT CGGCCGAAAA GGCGTCGTGG
AAAGCCAGCC TGACCTACAA GCCGTCGCGC GACCTGACGA CCTACGCCAC CGTCTCCACC
GGCTACCGGA CGCCCGTCTA CAACGGGCGC GCCGGCAGCG TCAGCACGGT CAATCCGAGC
GACCTGGTCA TTCCGGCCGG CGCGGGCTCG GACAATCTCA TCAACTACGA GGTCGGCCTG
AAGGGGCGCT GGCTGGACGG GAAGCTGAAC GCGAACCTGG CGGCCTATTA TATCGACTGG
AAGAACATCC AGGTTCAGGC CAACCGCCAG TCGGATTCGA TCCAGTTCGC CACCAATGTC
GGGCGCGCCG CCAGCAAGGG GCTGGAGGCG GAAGTCACGC TCGCGCCGGT CCGCGGGCTG
GTGTTGGGGC TGAACGGCTC GCTCAACGAC GCCAAGGTGA CCGAGCTCTC TCAGCAGGAG
GCCGTGATCT CCGGGGCGGT GGATGGCGCG AGGCTGGCGT CGCCGCACGT GCAGGGGGCG
TTGTTCGGCA CGTACAGCTA CGCCGTGGGC GACGGGGCGA CGGGCTTCAC CAGCGTCCAG
ATCCAGCACG TTGGTTCATT CCCCAACGGC TTCCCCAACA AGCCGGGCAC GCCGGGAACG
CTTAGCCCGC TGTACGGACA CACCGACAGC TACACCTACG TCAACCTGCA GACGGGCCTG
ACGTTCGGCA AGCTGAGCAC GACCCTCTAC GCCGAGAACC TCGGCAACAG CCGGGCGACG
GTCTACATTC ACCCCGAAGC CTTCGTTTAC AGTCGCAACG CGATCGTCCG GCCGCGCACG
TTCGGCGTCC GGGTGGGCTA CGACTTTTGA
 
Protein sequence
MAGGFYLHRD TDLNGEDRSS SAFLTTRRLT GLPGATFAKF GADTRTYELA GFGELTYHLT 
DKLSATGGLR YGKYGGTVDT YAGFNTAYFT YALLGFSGPL ALTPSPASTT KYPSAEKASW
KASLTYKPSR DLTTYATVST GYRTPVYNGR AGSVSTVNPS DLVIPAGAGS DNLINYEVGL
KGRWLDGKLN ANLAAYYIDW KNIQVQANRQ SDSIQFATNV GRAASKGLEA EVTLAPVRGL
VLGLNGSLND AKVTELSQQE AVISGAVDGA RLASPHVQGA LFGTYSYAVG DGATGFTSVQ
IQHVGSFPNG FPNKPGTPGT LSPLYGHTDS YTYVNLQTGL TFGKLSTTLY AENLGNSRAT
VYIHPEAFVY SRNAIVRPRT FGVRVGYDF