Gene Caul_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1289 
Symbol 
ID5898744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1355602 
End bp1358538 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content64% 
IMG OID641561774 
ProductTonB-dependent receptor 
Protein accessionYP_001682917 
Protein GI167645254 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.384181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0181603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTC ATAATTCTCG TCGCAACGCC ATGCTCATGG GCGCGTCAGC GCTCGTCATC 
TGCGGTGCTC TGCCCGCGAT CGCTCAGGCG CAAACAACCG CGCCGGCGTC GTCGGCCGAC
ACCGTCGAGG AAGTCGTCGT CACCGGCCAG CGCGCCGCCC TGCAATCGGC CCAGAAGCTG
AAGCAGAACG CCGAGCAACT GGTCGATTCG ATAACCGCCA CCGATATCGG CGCCTTGCCG
GACCGCAGCG TCACCGAAGC CCTTCAGCGT GTCGCGGGGG TCACCATCGG CCGCACGGCC
GACGGCCGCG ACGCCGACCG CATTTCGGTC GAAGGCAGCG GCGTCCAGGT TCGCGGTCTC
AGTTGGGTCC GCGGCGAAGT GAACGGCCGC GACAGCTTCT CGGCCAAGGC CGGCCGCACC
CTGAGCTTCG AGGACGTTCC GCCCGAACTG ATGTCCGGCG TCGACGTTTA CAAGAACCCT
TCGGCCGACA TCATCGAAGG CGGCGTCGGC GGCACCGTCA ACCTGCGCAC CCGCCTGCCG
TTCGACAGCG GCAAGCGCAA TCTGGCCTAT TCGGCCGACT CCAGCTGGGG CGACCTGTCC
AGGAAATGGG AGCCCAGCGG CTCGGTTCTC TACAGCAACC GGTGGGACAC CAAGATCGGC
GAGCTCGGCT TCCTGATCGA CCTGTCAGAC TCCAAGCTGA GCAGCCGCAC CGACACCATC
TCGGTCGACC CCTATTTCGC GCGCACCAAC ACCCTGGTCC CCGGCAAGAC CGTCTATGTC
CCGGGCGGTT TCGGCTATCG CAGCCTGGAC TTCGAGCGCG AACGCAAGGG CATCGCCGCC
GCCCTGCAAT GGCGCCCGAA CGAGCAATGG GACGCCAGCC TGCAGTTCCT GCGCTCGTCG
GCCTCGCAGG CCTCGACGGA GCACGCGGTC GGTTTCAATC CCGGATCGAC CAACGGGCCG
GCGACCGGCA CGGACTTCAC CTACGATTCG GACGGTCATT TCCTGAAGGG CACGCTTGCC
CAGACGCCGG GCGGCTCGAG CCTGGGATCG TCGACCATCG ACACCCGGTA CTCGGATCGT
AGCTCGGTGA CCTCCGACTA CGCCCTGAAG GTGAAATACA CTCCCAACGA CAAGTGGGCG
TTCAGCGGCG ACATCCAGTA CGTCTACGCC AAGACCAAGA CCGTCGATAA CACCGCGTTC
AACGCGCTGA ACAGCGACGC CGCGCCCGCC AGCCTGGACC TCACCGGCAG CCTGCCGGTG
ATCACCATGA ACAACGACAC GGCCTACACG TCCAACGCCG CCAACTACTA CCTGCAGGCG
GCGATGGATC ACCACGACCG CAACGAGGCG GCCCAGTGGG CCGAGCGCTT CGACGGCGAC
TACAGCTTCG ACGATGGCGG CTGGCTGAAG TCGTTCCGCT TCGGCGTTCG CCACACCTTC
CGCCAGGCGA CCACCCGCGA AACCAACTAC CGTTGGGACA CGGTGGCGCC GAGCTGGTCG
GCGACCGCTC CGATCAGCAC GCTGGACGGC TACCAAGGCT ATTACGGGCT CTACGAATTC
GACAACTACT TCCGCGGCAA GGCCCACCTG CCCGCGACCT TCGTCATGCC GAGCGCGCAG
TTCGTGAACA ACTACGGCGA CACCTCGCTG GTGCTCTCCA AGATCGCGCA GCAGAACGGC
GGCGGCTGGC GGCCCTTCAA CGGTGTCTTT GACGACCAAG GTCAGGCCGG CGGCAAGGGC
AGCATCAACC ACCAGAAGGA AGAGACCCTG GCGGCCTACG GCCTGCTGCG GTTCGGCCAT
GACGTCTCGC TGTGGGGCGA TCAGCGCGAG ATCGACGGCA ACTTCGGCCT CCGCGTCGTC
AAGACCGAAA GCCAGAGCCT GGGCATGCAG GTGTTCACCC CGAACACCAC CAGCACCGAC
ATTCCGGCCG CCGATCAGGC GTTCTCGAAC GGCGCCAAGA GCCCCTACAA GGGCGGTCGC
GACTATGTGA GCGTCCTGCC CAGCCTGAAC GTCCGCCTGA AGATCACGCC GGACATGTTC
ATCCGCTTCG CGGCGGCCAA GGCCATCGTC CGTCCGGACT TCCAGCAACT GCAGCCGAAC
TACACGATCT CGGCCACCAA CGGCTTCATC ACCGGCGGGA CCTGCTCCAG CACCATCCCG
GGCGGCTCTC AGGCCAACTG CGTCTATCAG TACACGGCCA ATGCCGGTAA TCCGGACCTG
AAGCCGACCC GTTCGACCCA GTTCGACGTG TCATACGAGT GGTACTTCAA CTCGACCGGC
AACGTGACGG CGACGGCGTT CTACAAGGAC ATCTACAACT TCGTCACCAA CGGCTCGACG
AACCTCAACT TCACCAACAA CGGCGTGACC CGCACCGTTC AGGTGGTCCA GCCGTACAAC
GCCGGCCACG GCACGATCAA AGGCTTCGAA GTCGCCTACC AGCAATATTA CGACTTCCTG
CCAGGCGTCC TGCGCGGCCT GGGGACGCAA GCCAACTTCA CCTATGTCGA CAGCAAGGGC
TCGCGCAACG CCGCGTCCAA CCCATACGAC ACCAACCAGG TCGGCAATAT CAGAACCAAT
GGGGAGGAGC TGCCGCTCGA GGGCCTGTCG AAGAAGAGCT ACAACGTCGC GGCGCTATAC
GACCTGGGCA AGGTCTCGGC GCGCCTCGCC TACAACTGGC GTGAGCGCTA CCTGCTGACC
ACCACGGCGG CGAACATCAA CATTCCGGCC TGGTACGGCT CCTACGGCCA GCTGGACGGC
TCGGTGTTCT ACACGGTCAA CGACGCGCTG AAGATCGGCT TCCAGGCGGC CAACCTCACC
AACACCCGGA CCAAGATCCT GGTCAGCTAT CCGGGCAAGC CCGAGGAAGG TTTGACGAAC
CACAACTGGG TTGTCGCCGA TCGCCGCTAC TCGATCGTGC TCCGCGGCAC GTTCTAA
 
Protein sequence
MSLHNSRRNA MLMGASALVI CGALPAIAQA QTTAPASSAD TVEEVVVTGQ RAALQSAQKL 
KQNAEQLVDS ITATDIGALP DRSVTEALQR VAGVTIGRTA DGRDADRISV EGSGVQVRGL
SWVRGEVNGR DSFSAKAGRT LSFEDVPPEL MSGVDVYKNP SADIIEGGVG GTVNLRTRLP
FDSGKRNLAY SADSSWGDLS RKWEPSGSVL YSNRWDTKIG ELGFLIDLSD SKLSSRTDTI
SVDPYFARTN TLVPGKTVYV PGGFGYRSLD FERERKGIAA ALQWRPNEQW DASLQFLRSS
ASQASTEHAV GFNPGSTNGP ATGTDFTYDS DGHFLKGTLA QTPGGSSLGS STIDTRYSDR
SSVTSDYALK VKYTPNDKWA FSGDIQYVYA KTKTVDNTAF NALNSDAAPA SLDLTGSLPV
ITMNNDTAYT SNAANYYLQA AMDHHDRNEA AQWAERFDGD YSFDDGGWLK SFRFGVRHTF
RQATTRETNY RWDTVAPSWS ATAPISTLDG YQGYYGLYEF DNYFRGKAHL PATFVMPSAQ
FVNNYGDTSL VLSKIAQQNG GGWRPFNGVF DDQGQAGGKG SINHQKEETL AAYGLLRFGH
DVSLWGDQRE IDGNFGLRVV KTESQSLGMQ VFTPNTTSTD IPAADQAFSN GAKSPYKGGR
DYVSVLPSLN VRLKITPDMF IRFAAAKAIV RPDFQQLQPN YTISATNGFI TGGTCSSTIP
GGSQANCVYQ YTANAGNPDL KPTRSTQFDV SYEWYFNSTG NVTATAFYKD IYNFVTNGST
NLNFTNNGVT RTVQVVQPYN AGHGTIKGFE VAYQQYYDFL PGVLRGLGTQ ANFTYVDSKG
SRNAASNPYD TNQVGNIRTN GEELPLEGLS KKSYNVAALY DLGKVSARLA YNWRERYLLT
TTAANINIPA WYGSYGQLDG SVFYTVNDAL KIGFQAANLT NTRTKILVSY PGKPEEGLTN
HNWVVADRRY SIVLRGTF