Gene B21_03231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03231 
Symbolybl143 
ID8116242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3423844 
End bp3424836 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content47% 
IMG OID644849408 
Producthypothetical protein 
Protein accessionYP_003000981 
Protein GI251786677 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.862274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT TCACCGGTGT TTTACTATTA GGCACGGCGT TACTGGCGGG ATGTGTCGAC 
CGGGAAGGGT ACTATAACAG CGTTAGGGAA GAAGAGAGCC ATGGACTGAC GTCTCTGCGG
GGGCAACCTG CATTACGTTA CAGCGATGAT TGGTCAAGAT GGCCGAGAGT GTACGGCGCT
ACAGCCTTAT ACCCGCTGTA TGCCTCCGCG TATTATAAAT TAGTACCCGA GCCAAAAGAT
AAGGATCGAA CCTCGCTGGC CTGGCAGGCG TATGGTTTGC AGCAAACCCG AACAGCTGAA
GCCTACGATA GTCTGATTAA AGGTTCCGCG ACGGTTATTT TTGTTGCACA ACCGTCGGAA
GGACAGAAAA AACGTGCAGA AGAAGCGGGT GTTAAACTGA AATATACCGC TTTCGCCCGC
GAAGCCTTTG TCTTTATCGT TGATATTAAT AACCCGGTAA ATTCTCTCTC TGAGCACCAG
GTTAAAGATA TTTTTAGCGG CAAAACTAGC CGCTGGAATA AAGTAGGTGG TAGTGACGAA
CATATAAAAG TCTGGCAGCG CCCTGAAGAT TCTGGAAGCC AAACGATTAT GAAGGGGTTG
GTTATGCAAG ACACCCCAAT GCTGCCAGCT AAAAAATCCA CTGTGATTGA TCTTATGGGC
GGTTTAATTA CTGAAGTTGC CGACTATCAA AACACGCCAT CTTCCATTGG GTACACCTTC
CACTATTACG TCACTCGTAT GAATGACAAT ATGCTCAAAA TGCGCAAGCA GATTAAACTT
TTGGCTATAA ATGGCGTTGC GCCTACCGAG GAAAATATCC GCAACGGCAC TTATCCATAC
ATTGTGGATG CCTATATGGT GACGCGTGAA AATCCCACGC CGGAAACGCA GAAATTTGTT
GACTGGTTTA TAAGTCAGCA GGGGCAACAG TTGGTAGAGG ATGTGGGGTA TGTGCCGCTG
TATGAAGCAT CCCCCGAATC ATCAGGACAA TAA
 
Protein sequence
MNKFTGVLLL GTALLAGCVD REGYYNSVRE EESHGLTSLR GQPALRYSDD WSRWPRVYGA 
TALYPLYASA YYKLVPEPKD KDRTSLAWQA YGLQQTRTAE AYDSLIKGSA TVIFVAQPSE
GQKKRAEEAG VKLKYTAFAR EAFVFIVDIN NPVNSLSEHQ VKDIFSGKTS RWNKVGGSDE
HIKVWQRPED SGSQTIMKGL VMQDTPMLPA KKSTVIDLMG GLITEVADYQ NTPSSIGYTF
HYYVTRMNDN MLKMRKQIKL LAINGVAPTE ENIRNGTYPY IVDAYMVTRE NPTPETQKFV
DWFISQQGQQ LVEDVGYVPL YEASPESSGQ