Gene Caul_4553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4553 
Symbol 
ID5902014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4931867 
End bp4933312 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID641565072 
Productphosphate-selective porin O and P 
Protein accessionYP_001686171 
Protein GI167648508 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3746] Phosphate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.354603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAG CCCAGACCGC CCTGCGCACG GCGTTGATCG CCGGAGCCTC GCTGATCGCC 
ATGACCGGCG CGGCCCATGC CCAGACCGTC CAGACGCCTG AACAGCAACA GGCGCGCATC
GAGGCCCTGG AGGCCCAGCT CGAGGCGCTG TCCAGCCAGA TCGCCGACCT GAAGGCCGCG
ACCGCCGCCA GCCTCAAGGA CGTCCGCACC GCCCAGGCGA CGACCACGGT CAGCATCGCC
GGCGGCAAGC CGAGCATCGC CTCGGGCGAC GGCGCTTTCA GCGCCAATAT CCACGCCGTG
ATGCAGCTGG ACGCCACCTG GTACAACCAG GACAGCGGCC TGCCGGCCGC CGTCACCGCC
CGCGACCTCA ACAGCGGCAC GAACTTCCGC CGCGCCCGCC TGGGCGTCGA CGGCAAGGCG
TTCAAGAACT TCGACTACGG CGTGCTGCTG GACTTTGGCG GCTCGGGCAC GGACGGGGCG
GGCGCGCTGC AGGAAATCTA TCTCCAGTAC AACTACGCCC CCTTCAAGGT GAAGGTCGGG
GCCTTCGCGC CGAACCTGGG CCTGGAGGAC GCCGCCTCGA CCAATGGTTC TCTGTTCCCC
GAGCGCCCCG CCGCCGCCGA GGCCGCCCGC GGCCTGGCCG GCGCCGACCG CCGCATCTCG
CTGCAGGCCC AGGCCGTCGG CGAGCGCTGG ATCGTCTCGG GCGCGGTGAC CGGCGCCAAG
GCCGGCGACG GCGCCACCTT CGACGAACAG CTGGGCTATG TCGGCCGCGT CGCCTTCATC
CCGTTCAGGG GCTATGACTG GCTGACCCAC GTGGGCGTCA ACGCCAGCCG CATCGCCCAG
CCCGCCCAGA CCGCCGTGGC CGGCGCCTAT CCGATCACCG TCGAGGACCG CCCCGAGCTG
CGCACCGATG GGACACGCCT GGTCAGCTCC GGCGCGATCG ACTCGGCCGG CGCCCGCCAC
TACGGCTTCG AACTGGCCGC CCAGAAGAAG AATTTCCTGA TCCAGGGCGA GTATTTCGAC
ATCGCGCTGG ACCGCCGCAA CCGCGCCGCC AACGTCACCG ATCCGAAGTT CAGCGGCTGG
TACGTCGAGG GCGGCTGGGT GCTGACCGGC GAGCAGCGCA AGTACAATGC CGCCAACTTC
GCCTTCGACG CGCCCGCTAT CGCCAATCCT TTCGACCCCA GGGCCGGCAA GTGGGGAGCC
TGGGAGCTGG CGGCCCGCTA CTCGGTGCTG GACGTCAACC ACCACGAGTA CGCGACCGTG
GCCGCCGACC GCGTCCGCGG CGGCGTGCAG GAAATCGTCA CCCTCGGCCT GAACTGGTTC
CCCAATTCGG TGACCAAGTT CTCGCTGGAC TATCTGGACG TCGATGTGGA TCGCCGCGAT
ACGGCGGGCG CGCTGATCGG CCAGAGCTAC AAGGCCGTCA ACCTGCGCAG CCAGTACGCG
TTCTAA
 
Protein sequence
MTKAQTALRT ALIAGASLIA MTGAAHAQTV QTPEQQQARI EALEAQLEAL SSQIADLKAA 
TAASLKDVRT AQATTTVSIA GGKPSIASGD GAFSANIHAV MQLDATWYNQ DSGLPAAVTA
RDLNSGTNFR RARLGVDGKA FKNFDYGVLL DFGGSGTDGA GALQEIYLQY NYAPFKVKVG
AFAPNLGLED AASTNGSLFP ERPAAAEAAR GLAGADRRIS LQAQAVGERW IVSGAVTGAK
AGDGATFDEQ LGYVGRVAFI PFRGYDWLTH VGVNASRIAQ PAQTAVAGAY PITVEDRPEL
RTDGTRLVSS GAIDSAGARH YGFELAAQKK NFLIQGEYFD IALDRRNRAA NVTDPKFSGW
YVEGGWVLTG EQRKYNAANF AFDAPAIANP FDPRAGKWGA WELAARYSVL DVNHHEYATV
AADRVRGGVQ EIVTLGLNWF PNSVTKFSLD YLDVDVDRRD TAGALIGQSY KAVNLRSQYA
F