Gene B21_03232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03232 
Symbolybl144 
ID8116243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3424862 
End bp3426367 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content48% 
IMG OID644849409 
Producthypothetical protein 
Protein accessionYP_003000982 
Protein GI251786678 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACA GAAAATGGAT TTTGACCTCG CTGGTAATGA CTTTTTTCGG CATCCCCATT 
CTGGCGCAGT TTTTGGCGGC GGTTATTGCC ATGCTGGGTG TCGGACTTGC CGGTATTATT
GAAGTTTGTA ATATCTTTAT CACGCCAACA ATTTACCTTC TGCTCAACAT TTTTATGCTG
GCGCTGGGCG CATTAATGCT ATTTTTCTCG GGGCGAGTGT GGGCGGACGA TAGTGCACCA
GAAAAAAGAG AAATAGCCGT CTGGCGACAA TGTCTTTTTT TAGTACCCGC ATTATTAACC
CTGGGGGTCT GGATAATCGC GCTGCATCTG GCAGATTATC AATTTCGCCA GATGGGAGCG
GGTTGGTTGG CTGATCTTAT GCTCCCCTGG CTGGGCGTTT TGTTAGCCTC ATTAGTCGGT
GGTGAGTACT GGTGGTTAGT CATTATACCT GTTGGCGCGC ATATCAGTTT TTCGCTGGGG
TACGGCTGGC CGACCAGATA TCCTTTAACG GGCACGTCCG GGTTACGTTG CCGTAATTCT
CTCTTGTTTA TCCTTCTCAT GCTTGGTTTT GTCGCCGGTT ACCAGGCTTA TTTATATAAA
CAGCTTAATC CCGGCGTCGG TGTGCGTGAA AATATTGATA CCTGGGCCTG GCGACCCGAT
AAACTCAATA ATCAACTGAC ACCACTGCGT GGTAAACCGC AAATTCAGTT CACGCAAAAC
TGGCCGCGAC TTGATGGCGC AACGGCGGCG TACCCCATTT ATGCCTCTGC CTTTTATGCA
CTAAGCGTTT TGCCGGAAGA TTTTCACGAA TGGGAATATC TGGCGAACTC TCGTACTCCC
GAAGCATATA ACAAGATTGT TAAAGGTAAT GCCGATATTA TCTTTGTGGC TCAACCTTCC
GGTGGGCAGA AAAAACGCGC GGAGGAATCG GGCGTCACTT TGATTTACAC GCCTTTTGCC
CGTGAAGCGT TTGTTTTCAT CGTCAATGCA GATAACCCGG TTAATTCCCT GACCGAACAA
CAAGTGCGTG ACATCTTCAG TGGTGCAATT ACCAACTGGC GCACGGTTGG CGGTAACGAT
CAGGAGATCC AGACCTGGCA GCGCCCGGAA GACTCTGGCA GCCAGACAGT GATGCAATCA
CAGGTCATGA AAAATGTCCG CATGATCTCG CCGCAGGAAA CGAAAGTGGC AAGCGTGATG
GAGGGAATGA TTAAAGTCGT TGCCGAATAC CGTAATACAA ACAACGCAAT AGGCTATACC
TTCCGCTATT ACGCGACGCA AATGAATGCT GATAAAAATA TAAAATTGCT AGCGATTAAC
GGTATTACAC CGACGGCGGA AAACATTCGC AACGGCAAAT ATGCGTACAT CGTCGATGCA
TTTATGGTGA CGAGAGAAAA TACAACGTCA GAAACACAAA AACTGGTCGA ATGGTTTTTA
ACGCCGCAGG GGCAGAGTCT GGTAGAAGAT GTGGGATATG TGCCGCTGTA TCCAACAATG
GAATAA
 
Protein sequence
MQNRKWILTS LVMTFFGIPI LAQFLAAVIA MLGVGLAGII EVCNIFITPT IYLLLNIFML 
ALGALMLFFS GRVWADDSAP EKREIAVWRQ CLFLVPALLT LGVWIIALHL ADYQFRQMGA
GWLADLMLPW LGVLLASLVG GEYWWLVIIP VGAHISFSLG YGWPTRYPLT GTSGLRCRNS
LLFILLMLGF VAGYQAYLYK QLNPGVGVRE NIDTWAWRPD KLNNQLTPLR GKPQIQFTQN
WPRLDGATAA YPIYASAFYA LSVLPEDFHE WEYLANSRTP EAYNKIVKGN ADIIFVAQPS
GGQKKRAEES GVTLIYTPFA REAFVFIVNA DNPVNSLTEQ QVRDIFSGAI TNWRTVGGND
QEIQTWQRPE DSGSQTVMQS QVMKNVRMIS PQETKVASVM EGMIKVVAEY RNTNNAIGYT
FRYYATQMNA DKNIKLLAIN GITPTAENIR NGKYAYIVDA FMVTRENTTS ETQKLVEWFL
TPQGQSLVED VGYVPLYPTM E