Gene Caul_4694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4694 
Symbol 
ID5902156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5076101 
End bp5078158 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content69% 
IMG OID641565213 
ProductTonB-dependent receptor 
Protein accessionYP_001686312 
Protein GI167648649 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGTC GCGTTCTGCT CTCCGCCGCG AGCCTGTTGG CGTTCGCCCT TGTCGCGCCC 
CTTGCTCAAG CCGCCGAAAC CCCAGCGCCG GCCGACGCCG CCGCGCCGGA CGGCCCCGAC
GGCCCCACCG CCGTCGACAA GGTTGTGGTC ACCGCCGCCC CCTACGCCGT CTCGCTCGAC
ACCGTGACCA GCAGTGTCAA CGTCGTGACG CGCGATCAGC TCGACGTCGC CGCGCCGGCC
GGCATTGGCG ACATGCTCAA CGGCCTGCCG GGCTTGCGTT CGACCTTCTA CGGTCCCGGC
GCCTCGCGGC CGGTGATCCG GGGCCTGTCG GGACCGCGGG TGATGGTGCT GCAGAACGGC
GTCGGCCAGG TGGACGCCAG CGCCCTGTCG CCTGACCACG CCGTGGCCAG CGACCCGGGC
GAGGCCTCGC GCATCGAGGT GCTGCGCGGT CCCTCGACCC TGGCCTATGG CGGCTCGGGA
ATCGGCGGGG TCGTTAACAT GATCGACGAC CGGGTGCCAT CGACGCCGGC GGCAAACGGT
CCCGAGGGCC GGTTGTCGGC TTCAGCCTCA TCGGTGGACA AGGGCTACGC CTATAGCGGC
GCGCTGAAGG CCGGCTCCGG CCCCATTGTC TTCGCGCTCG ATTTCTCGAG CCGCCGCACC
GACGATTACG ACGTGCCGGT CGCGCCGGTT TCCGACCGCC TGGCGGCCCG GGACGGCCTC
ACCGTCGATC CGGACAAGAC GGTCAAGAAC ACCGATGTCG AGGTCGACGC CTACGGCGCT
GGCGTCTCCT GGGTTCACGA CCGCGGCTTC GTCGGGGCGT CGGTCAAGAA AATGGACACC
ACCTACGGGG TTCCCTACGA GCAGATCCTG GCGCCGATCG ACCCCAACGC CGAGGGGCCG
GTCTCGATCC ACCTGCAGCA GACCCGCTAC GACGTGCGCG GTGAACAGGC GCTGGACACG
CCTTGGTTCG AGAAGGTTCG CGTCTCGCTG GGCTATGCCG ACTACGAGCA CGCCGAGGTC
AGTGTCGAGG ACGGCCAGGT CGGCACCCGG TTCCTGTCGC ACGGGACCGA GGGGCGGGTG
GAACTGGTTC ACCGCGAGCA CGACGGCCAC CAAGGCGCCA TCGGCTTCCA GGCTCTGGAC
CGCCACTTCT CGGCGATCGG CGACGAGGCC TTCGTGCCCT CGACCGACAT CAAGGAGTAC
GGAATCTTCA CCCTGCAACG CCTGGACCGC GGGACCTGGG GGATCGACGC CGGCCTGCGC
TTCGACACCC GATCCTTGCA GACCCCGACC GAGAAGCGCG ACTTCGACAA TGTTTCCGGT
TCGATCGGCG TGTTCCTCAA GCCGAGCGAC AGCCTGTTCT ACGCCCTGAC CCTGTCGCGT
AACGGTCGCG CCCCGACCGA GTTCGAACTG TTCGCCAATG GCCCTCATCC GGGCACCGGC
GGCTTCGAAG TCGGTGACAA CAAGCTCGAC AACGAGACGG TCACCTCGCT GGAGGCCACG
GTGCGCTGGA AGAGCGACCG CCTGCGCGCC GAGGGCCACC TGTGGGCCGC CAAGTACGGC
AGCTTCATCG AGGAGGCCCC GACTGGCGCG GTCGAGGACA ACCTGCCCGT CTACCAGTAC
TTCCAGACCA AGGCCGACTT CCACGGCGCC GAACTGGAGG CCAGCTACGA CGCCTGGCGC
GGGGCGACCC AGTCGCTGCG CCTGGAGACC ACGTTCGATT GGGTGCATGG CGACACCGAC
GCCGGCGTGC CCGCCCGCAT CCCGCCGTGG TCGCTGGGCG GTTCGGTGGT CTGGAACGTT
CCGCGCGTCG AGACCACGCT GGAGGTGCGC CGCGTGGCCG GACAGGATCG TGTCGCGCAG
TTCGAACTGC CCACCGACGG CTATACGGTG GTCAACCTCA AGGCCACGTT CAAACCATCC
GAAACCTCGC CCCTGCGGCT GTTCATCGAC GGCCGCAACC TGACCAACGC GGAGATCCGC
GAGCACGCCT CGTTCCTCAA GGACATCGCC CCGTCGCCGG GGCGTTCGGT GCGGGCGGGG
GTGGCCTGGA AGTTCTGA
 
Protein sequence
MPRRVLLSAA SLLAFALVAP LAQAAETPAP ADAAAPDGPD GPTAVDKVVV TAAPYAVSLD 
TVTSSVNVVT RDQLDVAAPA GIGDMLNGLP GLRSTFYGPG ASRPVIRGLS GPRVMVLQNG
VGQVDASALS PDHAVASDPG EASRIEVLRG PSTLAYGGSG IGGVVNMIDD RVPSTPAANG
PEGRLSASAS SVDKGYAYSG ALKAGSGPIV FALDFSSRRT DDYDVPVAPV SDRLAARDGL
TVDPDKTVKN TDVEVDAYGA GVSWVHDRGF VGASVKKMDT TYGVPYEQIL APIDPNAEGP
VSIHLQQTRY DVRGEQALDT PWFEKVRVSL GYADYEHAEV SVEDGQVGTR FLSHGTEGRV
ELVHREHDGH QGAIGFQALD RHFSAIGDEA FVPSTDIKEY GIFTLQRLDR GTWGIDAGLR
FDTRSLQTPT EKRDFDNVSG SIGVFLKPSD SLFYALTLSR NGRAPTEFEL FANGPHPGTG
GFEVGDNKLD NETVTSLEAT VRWKSDRLRA EGHLWAAKYG SFIEEAPTGA VEDNLPVYQY
FQTKADFHGA ELEASYDAWR GATQSLRLET TFDWVHGDTD AGVPARIPPW SLGGSVVWNV
PRVETTLEVR RVAGQDRVAQ FELPTDGYTV VNLKATFKPS ETSPLRLFID GRNLTNAEIR
EHASFLKDIA PSPGRSVRAG VAWKF