Gene Caul_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0226 
Symbol 
ID5897500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp242157 
End bp244394 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content66% 
IMG OID641560710 
ProductTonB-dependent receptor 
Protein accessionYP_001681861 
Protein GI167644198 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.385209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.613725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA TGCTCAAGCT CGCGCTGCTG GCTGGCGCCG CCTGGTCCGC GGCCGCCACC 
ACCGTGGCCG CTCAAGACAC CGCGCCGGCG CCGACCTCGG ACGGCTTCGC CATTGAACAG
GTCGTCGTCA CGGCACGTCG CCGCGAGGAA AGCCTGCAGG ACGTGCCTGT CGCGGTCTCG
GCCTTTTCGG CCGCGCGGCT GGAGCGAACC GGCGCTCAGG ACATCACCGA ACTGACGCGC
TCGGCGCCCA GCCTGACCAT TCAGGCGGCG CGCGGTTCGA ACTCGACGCT GATCTCGTTC
ATTCGCGGCA TCGGTCAGCA AGACCCGTTG TGGGGCTTCG AGCCCGGCGT CGGCCTCTAT
GTCGATGACG TCTACATCGC CCGCCCGCAG GCTGCCGTGC TGGACATCTT CGACATTTCC
CGCATCGAAG TGCTGCGGGG GCCGCAGGGC ACCCTCTACG GCCGCAACAC CATCGGCGGC
GCCATCAAGT ACGTGACCGA CAAGATCGGA TCGGAAAACG AGGCGACGAT CAAGGGGGCC
TACGGCTCAT ACAATCAGCG CGAACTGGTC GCCAGCGGCA AGGCGCGCCT CACAGACACC
TGGGCGGTGT CCGGCGCCTT GGCCCGCTAT CTGCGCGACG GCTACGGCAA GAACCTCAAC
ACCGGGGCCG AGCACTACAA CAAGGACGTC TGGGCCGGCC GGGCCAGCGT CGAATGGCAG
CCGACGCAAG ACGTGTTCTT CCGCCTGGCG GGCGATATCA CCCGTGACGA CTCCAACCCC
CGCCACGGGC ACCGTGAGAT CGCGCCGATC CCCTCCAGCG TCTACGACAC CAACGCCGGG
GCCGGCGACA AGAACAAGGT CGAGGCGCGC GGCGTGTCGT TGCTCGCCCA GTGGGACGTC
GACGACCAGT TGACCCTGAA GTCGATCACC GCCTATCGGG CGGGCGAGAC CGACGGCGTC
ATCGACTTCG ACAACCTTCC CGGGCCGCTG CTCGACATCC CGGCCGCTTA TCGCGATCAC
CAGTTCTCGC AGGAACTGCA GGCGCTGTAC GAAGGCGACA GAATCCATGC CGTGGGCGGC
GTCTACTATC TGAGCGCCAC CGCGTCGGGC GCCTTCGACA CCGTGGTGGG GGGCGCCAAC
CTGACAACCC TGACCCAGGG TTATGTCGAT ACCGAGAGCG TCTCGGCCTT TGGCGACGTC
AGCTACGATC TCACCGATCG CCTGTCGCTC TCGGTCGGGG GGCGCATCAC CCGCGACAAG
AAGACCGGCA ATGTCTTCCG GCAACGGTAT CTCGGAATCC GCAGCCCGTT CTTCGGCAAT
CCCGCCGCCA TCGCCTTCGA GGCCCCGCGC ACCAACTACA CGCGCACGGC CACGTTCGAG
AAGTTCACGC CTCGCGTCAG CGCCAGTTAC AAATTCTCGC CGGACCTGAC GGGCTATGCC
TCGTGGGGCA AGGGCTTCAA GTCGGGCGGT TTCGACATGC GCGGCGACAA GGTCGCCTAT
CCGGCCACCG ACCAACCCTA CAGCCCCGAG AACGTCGAGA CGGTGGAGCT GGGCCTGAAG
GGCTCGCTCA TGGATCGTCG CGTGACCTTC GCCACGGCGG TGTTCGACAC CAACTACAAA
GATATGCAGA TCACTACCCA GTTTCCGACC GCCACGCCGG GCGTCGTCGC CTCGGTGGTG
GACAATGTCG GCAGCGCCTC GATCCGGGGC TGGGAGCTGG AGAGTTCCGC GGTCATCAGC
TCCAGCTTCG TCGCAAACCT GATGCTCAGC TACATCGACG CCAAGTTCGA CCAGTTTCTG
GGCTACGTGC CGACCGGCCC GGCCAACGCC AGCTGCCCGA CCCTGCCCGG CTGTGTCGTC
GACTTGTCGG CCGTGCGAGC CTTCCAGAAC ACCCCCGAAT GGACCGGTTC GGCCAGCTTC
ACCTACACCC ACGACATGGG TTCCAACGGC AAGATATCGT TCACGCCGAC GGCGTCCTAT
CGCGGCGCCT ACCAGTTGTT CGAGGCGCCG CAGCCTATCC TCGACCAAGG CGCCTACTGG
CTCTATGACG CCAGCCTGGT CTGGACCTCG GCCGATGATC GCTACCAGAT CGGCCTGCAC
GGCAAGAACC TGGGCGACGA GGAGTATCGC GTCGGTGGCT ACGATTTCAG CTCCTTCGGC
GCCCTGACCG GCAATACGGT GATCGGCTTC TACGGCCCGC CGCGGTCGGT GACCCTGTCG
CTGCAAGCCA AGTTCTAG
 
Protein sequence
MNKMLKLALL AGAAWSAAAT TVAAQDTAPA PTSDGFAIEQ VVVTARRREE SLQDVPVAVS 
AFSAARLERT GAQDITELTR SAPSLTIQAA RGSNSTLISF IRGIGQQDPL WGFEPGVGLY
VDDVYIARPQ AAVLDIFDIS RIEVLRGPQG TLYGRNTIGG AIKYVTDKIG SENEATIKGA
YGSYNQRELV ASGKARLTDT WAVSGALARY LRDGYGKNLN TGAEHYNKDV WAGRASVEWQ
PTQDVFFRLA GDITRDDSNP RHGHREIAPI PSSVYDTNAG AGDKNKVEAR GVSLLAQWDV
DDQLTLKSIT AYRAGETDGV IDFDNLPGPL LDIPAAYRDH QFSQELQALY EGDRIHAVGG
VYYLSATASG AFDTVVGGAN LTTLTQGYVD TESVSAFGDV SYDLTDRLSL SVGGRITRDK
KTGNVFRQRY LGIRSPFFGN PAAIAFEAPR TNYTRTATFE KFTPRVSASY KFSPDLTGYA
SWGKGFKSGG FDMRGDKVAY PATDQPYSPE NVETVELGLK GSLMDRRVTF ATAVFDTNYK
DMQITTQFPT ATPGVVASVV DNVGSASIRG WELESSAVIS SSFVANLMLS YIDAKFDQFL
GYVPTGPANA SCPTLPGCVV DLSAVRAFQN TPEWTGSASF TYTHDMGSNG KISFTPTASY
RGAYQLFEAP QPILDQGAYW LYDASLVWTS ADDRYQIGLH GKNLGDEEYR VGGYDFSSFG
ALTGNTVIGF YGPPRSVTLS LQAKF