Gene Caul_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1360 
Symbol 
ID5898815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1444902 
End bp1447334 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content66% 
IMG OID641561847 
ProductTonB-dependent receptor 
Protein accessionYP_001682988 
Protein GI167645325 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAC AATCGCGCCT ACGGCTGACA CTGGGGGTGT CGGCGGTCGC GCTGCTCGCG 
GCGCTGCCCG CGGCGGCCCA AACCCAAGAC CGGCCGCGGG ACGACGCCCT GACGATCGAG
ACCCTGGTGG TCACGGCCCA GCGCCGCGAG GAAACCATCA ACAGCGTCGG CATGCCGATC
CAGGCCTTCA GCGGCGCGAC CCTGCAGCAA CTCAGGATCA CCGATCCCAA GGACCTCTCG
ACCATCGCGC CCAGTTTCAC GGTCAGCCAG AGTTACCAGG GCGTGCCGAC CTACACCCTG
CGCGGCATCG GCTTCAACAC GATCAACCTG TCGGCCACTT CGACGGTCGG GACCTATACG
GACGAGGTCG CTTACGCCTA TCCGTTCATG AACACCGGGC CGATGTTCGA CCTGGAGCGG
GTGGAAGTGC TCAAGGGACC CCAGGGCACC CTCTACGGCC GCAACACCAC CGCGGGGCTG
ATCGATTTCG TCACCAACAA GCCGACCGAA CAGTTCGAAG CCTCGCTGAC CGGCGAGGCG
GGCGGCGACA AGACCCACAA TGTCGAAGGC TATGTCAGCG GCGCCATCGC CCCGCGACTG
CAAGGCCGCT TCGCTTTCCG GAGCGAGGAC AGCGACGAAG GCTGGCAGGT CAGCAACACG
CGCAACGAGC GGCAGGGCGA GGTCCACCGC GACGGCTGGC GTCTGTCCCT GGCGGCGCAG
CCGACCGACA AGATCGACAT CGACTTCTCC TATGCGGGCT GGCGCAACGA CTCCGACACC
GTGGCCGCCC AGGCGATCGG CTTCACGCCC GCCACCGCCG CCAGTCCGTT CAACGCCCCG
GGTCTGGTCG CCTATGTCGC CTCGCATCGG CCGACCAAGG CCAGCCAGGG CGACTGGGCG
CCGCTGTCGA CCCGCGGCGC CGACATCGGG GCCGGCCTGG GCATCGGCGA TCCCGCCCGC
GAGAACGACG CCTTCGACGC CGGCAAACTG CGGATCGGCT GGGATCTGGG CGACAAGGTC
AGGCTGGTCT CGCTGACCAG CTACAACAAG TTGACCCGCG ACGCGGTGTT CGACTGGAGC
GGCGCGCCGT ACGAGATTCT GGTCCAGAAG GCCAAGGGCG AGATCGAGTC GAGCGCCGAG
GAACTGCATC TCGAGGGCGC CACCAACAAG GGCTCGTGGC TGGTGGGAGC CTATGTCGCC
CACGACGAGA TCCTGGACAG CAACCGCACC CTGCTGGGTC AGAACGCCAA TGTCGGGACG
ATCCGCTTCT ACGGCGCGGG CCTGCTGGCC TCGCCGTTCA ACAGCGGCGG CTACACGCCG
CTGCAGATGA GTCAGGCGTT CCGCACCTAC GAGGACGTCG GCAGTATCGA GACCGACACC
TGGAGCCTCT TCGCCAACGC CGACCACGCC CTGACAGACA CCCTGAAGCT GACCTTGGGA
ATCCGCTACA GTCAGGACAA GCAGGACTAT GTGGGCTGCT CCCGCGACTT CAACGGCGAC
ATGCTGCCCA ACGTCAACGT CGTGAACCGG GCGCTGTTCT TCGCCGCCTA CGGGCTGGTG
GCGCCGATCG TCCAGGGGGG ATGCAACACC TTCGACCCGG CCACGAAGAG CTTCGGCTTC
GTGAAGTCGA AGCTGGACGA GGACAACATC GCCTGGCGGG TCGCGCTGGA CTGGCGCGCG
TCCGAGGACG TGCTGGTGTT CGGCTCGGTC TCGCGCGGGG CCAAGGCCGG CGTCACCCCG
ATCAACGCCG CGAACATCTC CACCCAGAAC GCCCCGGCGC GACAGGAAAT GCTGACCGCC
TACGAGCTGG GCGTGAAGGC GGGCCTGTTC CAGCGCCGGG TTCAGGCCAA TGTCAGCGCC
TTCTATTACG ACTACACCGA CAAGCAGCTG AATGTGTATT TCGCCGATCC GATCTACACC
GCCCTCGCCC GTCTGGCCAA CGTGCCCGAC GCCGAGGCTC ACGGCGTCGA TGGCGACCTC
ACCTGGCGCG CTTCGCGGTC CCTGACCTTG ATCGCCTCGG CGACTTGGCT GCACACCGAA
GTCAAGGGCT ACACCGGCGT CAACTCCGCC GGCCAGCTCC AGAACTTCGA TGGGCAGCCC
TTTCTATACA GCCCAAAATT CCAGGGCGGC CTTACGGCCA TGTTCGACCG TCCGGTCGGC
GACGGCCTGC GTTTGAGGGC GGCGGTCAAC GGCCGCTGGC AAGGCAAGTC GCAAGCCGAT
CTGGAGGGCA ATCCGCTGTT CGTCATCGAC AGCTACGGGC TGCTCAACGC CAGCGTCGGC
GTCGCCTCGG ACAAGGGCTG GGAGTTGTCG ATCTGGGGCC GCAACCTGAC GGACGAATAC
TATTGGAGCG CGGTGAGCAG CAACGCCAAC ACCGTCGTCC GCTTCCCCGG CAAGCCCAGG
ACCTATGGCG CGGCCCTGAC CTGGAAATTC TAG
 
Protein sequence
MPGQSRLRLT LGVSAVALLA ALPAAAQTQD RPRDDALTIE TLVVTAQRRE ETINSVGMPI 
QAFSGATLQQ LRITDPKDLS TIAPSFTVSQ SYQGVPTYTL RGIGFNTINL SATSTVGTYT
DEVAYAYPFM NTGPMFDLER VEVLKGPQGT LYGRNTTAGL IDFVTNKPTE QFEASLTGEA
GGDKTHNVEG YVSGAIAPRL QGRFAFRSED SDEGWQVSNT RNERQGEVHR DGWRLSLAAQ
PTDKIDIDFS YAGWRNDSDT VAAQAIGFTP ATAASPFNAP GLVAYVASHR PTKASQGDWA
PLSTRGADIG AGLGIGDPAR ENDAFDAGKL RIGWDLGDKV RLVSLTSYNK LTRDAVFDWS
GAPYEILVQK AKGEIESSAE ELHLEGATNK GSWLVGAYVA HDEILDSNRT LLGQNANVGT
IRFYGAGLLA SPFNSGGYTP LQMSQAFRTY EDVGSIETDT WSLFANADHA LTDTLKLTLG
IRYSQDKQDY VGCSRDFNGD MLPNVNVVNR ALFFAAYGLV APIVQGGCNT FDPATKSFGF
VKSKLDEDNI AWRVALDWRA SEDVLVFGSV SRGAKAGVTP INAANISTQN APARQEMLTA
YELGVKAGLF QRRVQANVSA FYYDYTDKQL NVYFADPIYT ALARLANVPD AEAHGVDGDL
TWRASRSLTL IASATWLHTE VKGYTGVNSA GQLQNFDGQP FLYSPKFQGG LTAMFDRPVG
DGLRLRAAVN GRWQGKSQAD LEGNPLFVID SYGLLNASVG VASDKGWELS IWGRNLTDEY
YWSAVSSNAN TVVRFPGKPR TYGAALTWKF