Gene Caul_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4549 
Symbol 
ID5902010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4923871 
End bp4925172 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID641565068 
Productphosphate ABC transporter, inner membrane subunit PstA 
Protein accessionYP_001686167 
Protein GI167648504 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0581] ABC-type phosphate transport system, permease component 
TIGRFAM ID[TIGR00974] phosphate ABC transporter, permease protein PstA 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.429407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG CCGCCATCAA ACCCGGCGCG CCGGCCGCTC GCCCGGCCCT GTCGGCCCGC 
GAGGCCCTGC TCAAGAAGCG CCACCGCTCC GAGACCTGGT TCCGGGTCCA GGGCATCGCG
GCGATCGTCA TCGCCATGAT CTTCCTGGTC ATGCTGGTGG GCCGCATCGT CGCCCAAGGC
TACTCGACCT TCGAGACCCA CACCCTGACC GTGCCGGTCT ATCTGAACCC CGAGCGCATC
GACACGACCG CGCTGGAAGG GGTCAATTAC GACTACATTG TCGCCGAGGC GATGATGAAG
AAGCTGGGCG TGCAGGACGA CGACCTGGGC ACGACGTCGG GCAAGATCAT GGACCTGACC
TCGCGCGACT TCGGCAGCCA ACTGCTGCAG ATGATCAAGA AGGACCGCTC GCTGATCGGC
AAGACGGTCA ATGTCACCGG CTCGGTCAAG GCCGACGCCG ATCTCTACTA TAAGGGCGAG
ATCCAGCGAT CGACCGCCGA GGGCGACCGC AAGCTCGACA ACCAGCAACT GGACTGGCTG
GACAAGCTGA AGAACGAGGG CACGGTGAAG ACCGGCTTCA ACATCAAGTT CTTCACCAAC
TCCGACTCCA CCGAGCCTGA ACAGGCCGGC GTCTGGGGCG CGGTGATCGG CTCGGCCATG
ATGCTGATCA TCACCGCGAC GATCGCCATT CCGGTCGGCG TGATGGCCGC GGTCTACCTG
GAAGAGTTCG CCCCGAAGAA CCGCTGGACC GACGTGATCG AGGTCAACAT CAACAACCTC
GCCGCCGTGC CTTCGATCGT CTACGGCCTG CTGGGCCTGG CCCTGTTCAT CAACTGGCTG
CATGTGCCGC GCGGCTCGCC GCTGGTCGGC GGCCTGGTGA TGGCCCTGAT GGCCCTGCCG
ACCGTGATCA TCGCCACCCG CTCATCGCTG AAGGCCGTGC CGCCCTCGAT CCGCGAAGCC
GCCCTGGGCG TTGGCGCGTC CAAGGCCCAG ACGGTGTTCC ACCACGTGCT GCCGCTGGCC
ATGCCCGGCG TGATGACCGG CGCCATCCTG TCGCTGGCCC ACGCCCTGGG CGAAACCGCG
CCGCTGCTGA TGATCGGCAT GGTCGCCTTC GTGCCTGGCG CCCCGGAGAG CTTCACCAGC
TCGGCCACGG TGCTGCCGGT CCAGGTGTTC ATCTGGGAAA ACGCCTCGGA GCGCGCCTTC
CATGAACGCA CCGCAGCGGC CATCATCGTG CTGCTGGTCT TCATGATCGT CATGAACGCC
GCCGCCGTGA TCCTGCGTCG CCGCTTCGAG CGCCGGTGGT AG
 
Protein sequence
MTDAAIKPGA PAARPALSAR EALLKKRHRS ETWFRVQGIA AIVIAMIFLV MLVGRIVAQG 
YSTFETHTLT VPVYLNPERI DTTALEGVNY DYIVAEAMMK KLGVQDDDLG TTSGKIMDLT
SRDFGSQLLQ MIKKDRSLIG KTVNVTGSVK ADADLYYKGE IQRSTAEGDR KLDNQQLDWL
DKLKNEGTVK TGFNIKFFTN SDSTEPEQAG VWGAVIGSAM MLIITATIAI PVGVMAAVYL
EEFAPKNRWT DVIEVNINNL AAVPSIVYGL LGLALFINWL HVPRGSPLVG GLVMALMALP
TVIIATRSSL KAVPPSIREA ALGVGASKAQ TVFHHVLPLA MPGVMTGAIL SLAHALGETA
PLLMIGMVAF VPGAPESFTS SATVLPVQVF IWENASERAF HERTAAAIIV LLVFMIVMNA
AAVILRRRFE RRW