Gene B21_02644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02644 
SymbolygeD 
ID8115194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2810158 
End bp2811351 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID644848841 
Producthypothetical protein 
Protein accessionYP_003000414 
Protein GI251786110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0181673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGT CAGTGCACAC TAACACTTCG TTGTGGTCGA AGGGGATGAA AGCGGTTATC 
GTGGCGCAGT TTCTCTCTGC GTTTGGCGAT AATGCCCTAC TGTTTGCCAC TCTGGCGTTA
CTGAAAGCGC AGTTCTATCC GGAGTGGAGC CAGCCCATCC TGCAAATGGT GTTTGTAGGT
GCTTACATTC TTTTTGCGCC GTTTGTCGGG CAGGTGGCGG ATAGCTTCGC CAAAGGCCGG
GTGATGATGT TTGCCAACGG CCTGAAGCTG CTGGGCGCAG CCAGTATCTG CTTTGGTATC
AATCCGTTTC TCGGCTATAC GCTGGTGGGT GTTGGTGCTG CAGCCTATTC ACCGGCGAAA
TACGGTATTC TCGGCGAATT AACCACGGGT AGTAAGTTAG TGAAAGCTAA CGGTTTAATG
GAAGCTTCTA CCATAGCGGC GATTTTGCTC GGTTCCGTAG CCGGTGGTGT GCTGGCTGAC
TGGCATGTCC TCGTCGCCCT GGCCGCATGC GCACTGGCCT ACGGTGGTGC GGTCGTTGCC
AATATCTACA TTCCCAAACT GGCGGCGGCG CGTCCGGGGC AGTCCTGGAA TCTCATCAAC
ATGACCCGCA GTTTCCTGAA TGCCTGCACC TCGCTATGGC GCAATGGTGA AACGCGTTTT
TCGCTGGTGG GCACCAGTTT ATTCTGGGGA GCGGGTGTCA CGCTGCGTTT CCTGTTGGTG
CTGTGGGTAC CGGTGGCGCT GGGCATTACC GATAACGCTA CGCCCACCTA TCTCAACGCG
ATGGTAGCGA TTGGTATCGT GGTTGGCGCA GGTGCGGCAG CGAAGTTAGT TACGCTGGAA
ACCGTGTCAC GCTGTATGCC AGCCGGGATT TTGATTGGCG TGGTGGTACT GATTTTTTCC
CTGCAACACG AGCTGCTGCC AGCCTATGCC TTGTTGATGC TGATTGGCGT GATGGGGGGC
TTTTTTGTCG TTCCGCTCAA TGCGTTGCTA CAGGAGCGGG GTAAAAAAAG CGTCGGGGCG
GGGAATGCGA TTGCAGTACA AAACCTTGGC GAAAACAGCG CCATGTTGTT GATGCTGGGC
ATTTACTCGC TGGCGGTAAT GATAGGCATC CCGGTCGTGC CCATTGGCAT TGGCTTCGGT
GCGCTGTTTG CGCTGGCAAT AACGGCGCTG TGGATCTGGC AGCGCCGTCA TTAA
 
Protein sequence
MSESVHTNTS LWSKGMKAVI VAQFLSAFGD NALLFATLAL LKAQFYPEWS QPILQMVFVG 
AYILFAPFVG QVADSFAKGR VMMFANGLKL LGAASICFGI NPFLGYTLVG VGAAAYSPAK
YGILGELTTG SKLVKANGLM EASTIAAILL GSVAGGVLAD WHVLVALAAC ALAYGGAVVA
NIYIPKLAAA RPGQSWNLIN MTRSFLNACT SLWRNGETRF SLVGTSLFWG AGVTLRFLLV
LWVPVALGIT DNATPTYLNA MVAIGIVVGA GAAAKLVTLE TVSRCMPAGI LIGVVVLIFS
LQHELLPAYA LLMLIGVMGG FFVVPLNALL QERGKKSVGA GNAIAVQNLG ENSAMLLMLG
IYSLAVMIGI PVVPIGIGFG ALFALAITAL WIWQRRH