Gene Caul_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2195 
Symbol 
ID5899650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2387293 
End bp2388312 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID641562687 
Productsulfate ABC transporter, ATPase subunit 
Protein accessionYP_001683821 
Protein GI167646158 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.643313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.295178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCT CCATCCGTTC CGTCGAAAAG AAGTTCGGGC GCTATCCGGC GCTCAACAGC 
GTCGATCTGG AGATCGCCGA CGGCGAACTC GTGGCGCTGC TGGGGCCGTC CGGCTCGGGC
AAGACCACTC TGCTGCGAAC GATCGCCGGC CTGGAGTTCC CGGACAAGGG CCAGGTGCTG
TTCGAGGGCG AGGACGTGAC GTTCGCCTCG GCGGCGGCGC GGCGGGTGGG CTTCGTGTTC
CAGCAGTATG CGCTGTTCAA GCACATGACC GTGGCCAAGA ACATCGCGTT CGGCCTCGAC
GTCCGCAAGG GCAAGGACAA GCCCGACAAG GCCGAGATCG CCCGCCGCGT CGAAGAGCTT
CTGAAGCTGG TCGAGCTGGA CGGCCTGGGC AAGCGCTACC CCTCGCAACT GTCGGGCGGC
CAGCGGCAGC GCGTGGCCCT GTCGCGCGCC CTGGCGGTGC AACCCAGCGT GCTGTTGCTC
GACGAGCCGT TCGGCGCCCT GGACGCAACG GTCCGCAAGT CGCTGCGCAA GGAGCTGCGC
CGGGTGCATG ACGCCACCGG CGTGACCACC ATCTTCGTCA CCCACGACCA GGAAGAAGCG
CTGGAACTGG CCGATCGCGT GGCCATCCTC AACGCCGGCC GTATCGAGCA GATCGGCACG
CCGCACGAGG TGCACGACAA TCCGGCCACG CCGTTCGTCT GCGGCTTCGT CGGCGAAGCC
AACCGGTTCG AGGGAACGGT GTCGGGCGGA CGGTTCACGG CCGGGCCGGT GACGCTGCCG
GCGCCGCAGG CCGCCAACGG CGCGGCCGTG GCTTTCGTGC GGCCCCACGA CGTAGTGTTG
GACGCGGCGG GCTTCCCGGC CAAGGTTGAG CGGGTGGTGA TCCAGGGTCC ACTGGCCAAC
ATTGACGCCT CGCTGCCTGA CGGTCGCCGC ATCGAGATTT GCGCGGCCCG CGACGAGGCC
GCCAACTTCT CGGGTGAGGT CAGGCTTTCG GCCCGACGGA CGCACGTCTT CGCGGTTTAG
 
Protein sequence
MTISIRSVEK KFGRYPALNS VDLEIADGEL VALLGPSGSG KTTLLRTIAG LEFPDKGQVL 
FEGEDVTFAS AAARRVGFVF QQYALFKHMT VAKNIAFGLD VRKGKDKPDK AEIARRVEEL
LKLVELDGLG KRYPSQLSGG QRQRVALSRA LAVQPSVLLL DEPFGALDAT VRKSLRKELR
RVHDATGVTT IFVTHDQEEA LELADRVAIL NAGRIEQIGT PHEVHDNPAT PFVCGFVGEA
NRFEGTVSGG RFTAGPVTLP APQAANGAAV AFVRPHDVVL DAAGFPAKVE RVVIQGPLAN
IDASLPDGRR IEICAARDEA ANFSGEVRLS ARRTHVFAV