Gene Caul_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1672 
Symbol 
ID5899127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1757297 
End bp1759582 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content65% 
IMG OID641562162 
ProductTonB-dependent receptor 
Protein accessionYP_001683299 
Protein GI167645636 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.295189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.636127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGT TCGGTACCCA GAAGATTGGT CTGCTGACAG GGGCGGCCAT TTCGGCGCTG 
TCGTTCAGCG CCGCCCTCGC GCAGGAAGCG GCTCCGACCG CCCTGGACGA GATCGTCGTC
ACGGCCGAGC GCCGCTCGGA AAACCTGCAG AAGGTGCCGG TGTCGGTGGC CGTGGTCGCC
GGCGACCAGC TGCGCGCCAT CCAGGGCGGC GGCGACGACA TCCTCTCGCT GTCGGGCAAG
GTGCCCAGCT TCTATGCCGA GACCACCACG GGCCGGATCT TCCCGCGCTT CTACATCCGC
GGCCTGGGCA ATATCGACTT CTATCTCGGC GCTTCGCAGC CGGTGTCGAT CATCCAGGAC
GACGTCGTGC TCGAGCACGT GGTCCTGAAG TCCAACCCGC TGTTCGACGT CAAGCAGGTT
GAAGTGCTGC GCGGCCCGCA GGGCTCGCTG TTTGGCCGCA ACACCACGGC CGGCATCGTC
AAGTTCGACA CCAACCGTCC GACCGAGACC CTCGAAGGCC GCGCCAGCGC CTCCTACGGC
ACCTACGGCA CGACCACCTT CGACGGCGGC ATCGGCGGCC CGATCGCCGG CGACAAGCTG
ATGTTCCGCC TGTCGGCGCT GTACCAGCAC CGCGACGACT ATGTCGACAA CACCTTCGCC
GGCACGAGCG CCGACGGCAC CGTCACGCCG AAGAAGAACG CCATGGGCGG CTTCGACGAA
AAGGACGTGC GCCTGCAGAT CCTGGCCAAG CCGACCGATC AACTGACGGT CCTGGCCTCG
GCCCACGCCC GCAACTACGA GGGCACCTCG ACCCTGTTCC TGCGCCAGGC GCTGAAGAAG
GGTTCGAACA ACTCGACCGC CCCGCGCGAC AGCGTCGCCC TCGACGAGGG CAACAACAAC
CCGCAGGCCT ATGACACCCA GGGCACGTCG CTGAACGTCG CCTACGACTT CGGTCCGGTG
ACCCTCACCT CGATCAGCGC CTACGAGACC ACCAGCGGCT ACAGCCGCGG CGACACCGAC
GGCGGCGCGG CGGCCAACTA TCCGGTCGGC GGCGTTCCCA ACGGCTTTGG CCAGTCCATG
GGCCGGGTTC GCGACCTCGA CCAGCTGACC CAGGAAATCC GCCTGGCCAG CAACGGCGAC
GACCGCCTGA AGTGGCAGGT CGGCGCGCTG TACTTCGACT CGCGCGACAC GACCGACTTC
TATCAGCGCG CCTACTTCAC CAAGACCAAC CCCAACAACT GGGTCGAGCT GAACAACCTC
AACACCTCGT GGGCGGTGTT CGGCCAGGTC AGCTACAAGG TCACCGACGC CCTGACGATC
ACGGGCGGCC TGCGCGACAC CTACGACGCC AAGAAGACCA TCCTGGCCAA GACCGCCAAC
ACCGCCGCCA ACGCCGTGAC CTATGCCGGT CGCCGCTATG TGCGTCTGTC GGACGAACAG
GTCAGCTGGG ACCTGAGCGC CAACTACGAG GTCAATCCCG ACCTGAACCT CTACGCGCGG
GCCGCCAAGG GCTTCCGTGG TCCGACCATC CAGGGCCGCT CGGCGGTGTT CAACAGCGAC
TTCACGACCG CCAATTCCGA AACGATCCTG TCGTGGGAAG CGGGCTTCAA GAGCACGCTG
CTGGACAACA CCCTGCGCCT GAACGCCTCG GCCTTCACCT ATGAGGTCAA GGACATCCAG
CTGAACGGCA ACGATTCCAA CGGCAACGGC GTGCTGTTCA ACGCCGACAA GGCCAAGGCC
TACGGGCTGG AAGCCGACGC CGAATGGCGT CCGGTCTCGA ACCTGACCCT GACGGCCGGC
GTCAGCCTGC TGCACAGCGA GATCCAGGAC AAGCGCGTCT ACGCCCAGGT CTGCGCGCTC
AACAGCGTGG TGGTCTGTAC GGTCAAGAAT CCGACGATCG CGATCGTCGG TCCGTTCGGC
ACCAGCACCT TCGCCCAGAT CGACGGCCAG CCGCTGCCCA ACGCGCCGAA GTACAACTTC
AACTTCACGG CGCGCTACGA CGTGCCGGTC GGGGCCGACG GCAAGCTGTT CATCGCCACC
GACTGGAACG TGCAGGGCTA TACGAACTTC GTGCTCTACG ACACCGACGA GTTCTACTCG
AAGGGCAACT TCGAAGGCGG CCTGAAGCTC GGCTACGAAG GTGGTAACGG AGCCTATGAA
GTGGCCCTGT TTGGCCGCAA CATCACCAAC GAGAAGAACC TCAAGGGCGT GATCGAGAAC
TACATGGCGG CCGTCTATAA CGAGCCGCGC ATCGTGGGCA TCTCGGTCAG CGCGAAGCTG
AAATAG
 
Protein sequence
MRKFGTQKIG LLTGAAISAL SFSAALAQEA APTALDEIVV TAERRSENLQ KVPVSVAVVA 
GDQLRAIQGG GDDILSLSGK VPSFYAETTT GRIFPRFYIR GLGNIDFYLG ASQPVSIIQD
DVVLEHVVLK SNPLFDVKQV EVLRGPQGSL FGRNTTAGIV KFDTNRPTET LEGRASASYG
TYGTTTFDGG IGGPIAGDKL MFRLSALYQH RDDYVDNTFA GTSADGTVTP KKNAMGGFDE
KDVRLQILAK PTDQLTVLAS AHARNYEGTS TLFLRQALKK GSNNSTAPRD SVALDEGNNN
PQAYDTQGTS LNVAYDFGPV TLTSISAYET TSGYSRGDTD GGAAANYPVG GVPNGFGQSM
GRVRDLDQLT QEIRLASNGD DRLKWQVGAL YFDSRDTTDF YQRAYFTKTN PNNWVELNNL
NTSWAVFGQV SYKVTDALTI TGGLRDTYDA KKTILAKTAN TAANAVTYAG RRYVRLSDEQ
VSWDLSANYE VNPDLNLYAR AAKGFRGPTI QGRSAVFNSD FTTANSETIL SWEAGFKSTL
LDNTLRLNAS AFTYEVKDIQ LNGNDSNGNG VLFNADKAKA YGLEADAEWR PVSNLTLTAG
VSLLHSEIQD KRVYAQVCAL NSVVVCTVKN PTIAIVGPFG TSTFAQIDGQ PLPNAPKYNF
NFTARYDVPV GADGKLFIAT DWNVQGYTNF VLYDTDEFYS KGNFEGGLKL GYEGGNGAYE
VALFGRNITN EKNLKGVIEN YMAAVYNEPR IVGISVSAKL K