Gene Caul_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1474 
Symbol 
ID5898929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1566281 
End bp1568428 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content68% 
IMG OID641561961 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001683102 
Protein GI167645439 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.052337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCCA AAGCCCTGCT GCTCGCCCTG CTGGCGACCG CCGTTCCGTT GTACGCGATG 
GCGGCGGACG CCCCGGCCGA CGACAAGACC TCGGTGGACG GCGTCACCGT CACCGCCCGC
GCCCACGACA CCGTCGGCTC GACCGGCAGC AAGCTGGCCA CGCCCCTGGT CGATACCCCG
CAGTCGGTGG CGGTGATCAC CGGCGAGCGC ATCGACCAAC TGGGCCTGCA GTCGCTGAAC
CAGGCGCTGC GCTACACCGC CGGCGTCACG CCCGAGACCC GCGGCGGCGT GGTCACCCGC
TACGACCAGT TCAAGCTGCG CGGCTTCGAC GTCAACGCCA CCTTCCTGGA CGGCCTGAGC
AACCTCTATC CGGGCTGGTA CGCCGACGCC CAGGTCGACG CCTCGACCGT CGACCGCATC
GAGATCCTCA AGGGCCCGGC CTCGGTGCTC TACGGCAACT CGCCCCCCGG CGGCCTGATC
AACTATGTCA GCAAGACCCC CAGGGAGGTC GCGGGCGGCG AGATCGAGGT GCGGGCGGGT
AACAACAAGC TGGTCGAGGC GTCGATCGAC ACCACCGGCC CGATCGCCGG CGACACGCGC
TACACCTACC GCCTTGTGGC CATGGCCCGT CAGGGCGACG GCCAGGCCGT GACCACCGAG
CACCAGCGCT ATGTGGTCGC TCCGTCCTTC ACCTGGCGCC CGGACGAGGC CACGACCGTG
ACCGTTCTGG GCCGCTACCA GCACGATCCC AAGGCGGCCA GCTATGGCGG CGCGCCCTCG
GAGGGCTCGG CGTTCAAGAA CCCGCTGGGC CAGCTGCAGC CCGACTTCTA CGACGGCGAC
CCGAACTTCG AGGCCTACAA CCGCACCCAG GCCACCATCG GCTATCTGGC CGAGCACAGG
TTCAACGACA TCTTCGCCGT CCACCAGAAC CTTCGCTACA GCCGCGTCGA GAGCAATTAC
GAGTCGGTCT ACGCCACGGG CCTGGACGCC AACGACCGCA CCCTGCACCG CGCCACCGCC
GCCTCGCTGG AGAGCGTCGA CGGCTTCGTG GTCGACAACC AAGCCAGCGC CCACTTCACG
ACCAGCGCCT TGACCCACGA CGTGCTGTTG GGCCTGGACT ACCAGCACGC CATGGCCAAG
GTGCGGTCCG GTTTCGGCGC CGCGCCGGAC CTGGACATCT TCGCCCCGGT CTATGGTCAG
CCGATCATCG ATCCACGCGG CGACCCGACG GCCTATCGGT CGGACATGCG CATCAAGCAG
GAGCAGACCG GCCTCTATCT CCAGGACCAG ATCAAGCTGG ACAAGCTGAT CGTGCTGGTC
GGCGTGCGCC GCGACAGCCT CAAGCAGGAC ACCACGACCC TGGGAGCCTT CGGCGCCACG
ACGGTGATCG ATCAGGATCA CGCCAGCGGT CGCGTCGGCA TGCTCTATCA CTTCGACAGC
GGCTTCGCGC CCTATGTCAG CTGGTCGCAG TCGTTCGAGC CCCAGGGCCC TTACGGCACG
CGCACCTTCA AGCCCATCAC CGGCGACCAG ATCGAGGCCG GCGTGAAGTA CGAGTCGCCG
GACAAGAAGA TCTACGCGAC CCTGGCCGCC TTCGAGCTGA AGCGCCAGGA CGTGCTGTCG
CCCGATCCGG CCAACACCAA CGAGAGCATC CAGGGCGGCG AGGTGCGCTC GCGCGGCGTC
GAGTTCGAGG GGCGCGCCAA GCTGACCTCG CAGCTGTCGC TGTCGGGCGC GGCGACCTGG
CTCGACGTCG AGAACACCAA GGACATGCTC GCCACCGCGG ACTACGTCAC CTACTTCAAC
CTGAAGGGCC GCGCGCCGGT CGGCGTGGCC AAGAAGACCG CCTCGGTGTT CGCCGACTAC
GACTTCGACG GCGGCTTGGC CGGCCTCGGC GTCGGCGCCG GCGTGCGTTA TGTCGGCTCC
AGCTGGGGCA ACCCGATCAA CAGCTTCAAG GCCCCGGCCT ACACCCTGGT CGACATGAGC
CTGAGCTACG ACCTGGGCCA GATGAGCGAG GGCCTGAAGG GCTGGAAGGC CATGGCCAGC
GCCACCAACC TGTTCGACAA GCGCTATGTC TCGTCCTGCT ATTCCGACGC CTGGTGCTGG
TTCGGCGCCC AGCGCTCGGT GCAGGTCGGC CTCAAGCGCA GCTGGTAG
 
Protein sequence
MKSKALLLAL LATAVPLYAM AADAPADDKT SVDGVTVTAR AHDTVGSTGS KLATPLVDTP 
QSVAVITGER IDQLGLQSLN QALRYTAGVT PETRGGVVTR YDQFKLRGFD VNATFLDGLS
NLYPGWYADA QVDASTVDRI EILKGPASVL YGNSPPGGLI NYVSKTPREV AGGEIEVRAG
NNKLVEASID TTGPIAGDTR YTYRLVAMAR QGDGQAVTTE HQRYVVAPSF TWRPDEATTV
TVLGRYQHDP KAASYGGAPS EGSAFKNPLG QLQPDFYDGD PNFEAYNRTQ ATIGYLAEHR
FNDIFAVHQN LRYSRVESNY ESVYATGLDA NDRTLHRATA ASLESVDGFV VDNQASAHFT
TSALTHDVLL GLDYQHAMAK VRSGFGAAPD LDIFAPVYGQ PIIDPRGDPT AYRSDMRIKQ
EQTGLYLQDQ IKLDKLIVLV GVRRDSLKQD TTTLGAFGAT TVIDQDHASG RVGMLYHFDS
GFAPYVSWSQ SFEPQGPYGT RTFKPITGDQ IEAGVKYESP DKKIYATLAA FELKRQDVLS
PDPANTNESI QGGEVRSRGV EFEGRAKLTS QLSLSGAATW LDVENTKDML ATADYVTYFN
LKGRAPVGVA KKTASVFADY DFDGGLAGLG VGAGVRYVGS SWGNPINSFK APAYTLVDMS
LSYDLGQMSE GLKGWKAMAS ATNLFDKRYV SSCYSDAWCW FGAQRSVQVG LKRSW