Gene B21_01481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01481 
SymbolydeU 
ID8116332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1550173 
End bp1551573 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content53% 
IMG OID644847717 
Producthypothetical protein 
Protein accessionYP_002999290 
Protein GI251784986 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCG AAGGGGGGAA ACCGGGGAAT GTACTGACCG TTAACGGCAA CTATACCGGA 
AACAATGGCC TGATGACGTT CAACGCGACG CTGGGCGGCG ATAATTCGCC CACCGATAAG
ATGAACGTGA AAGGCGATAC CCAAGGGAAC ACTCGCGTTC GGGTTGATAA CATTGGCGGC
GTCGGTGCAC AAACGGTCAA CGGTATTGAA CTCATTGAGG TTGGCGGTAA TTCTGCAGGT
AACTTCGCGC TGACCACCGG AACTGTCGAA GCTGGGGCTT ACGTCTACAC GCTGGCTAAA
GGGAAGGGGA ATGACGAGAA AAACTGGTAT CTGACCAGTA AATGGGACGG CGTAACGCCA
GCGGATACAC CCGATCCCAT CAATAATCCC CCTGTTGTGG ATCCGGAAGG CCCATCAGTT
TATCGCCCGG AGGCCGGAAG CTATATCAGC AACATTGCCG CAGCCAACTC GCTGTTTAGC
CATCGTTTAC ACGACCGTCT AGGTGAGCCG CAGTATACAG ATTCACTGCA TTCTCAGGGG
TCGGCAAGCA GTATGTGGAT GCGTCATGTC GGAGGGCACG AACGTTCAAG GGCCGGTGAC
GGTCAGCTAA ATACTCAGGC TAACCGCTAT GTATTGCAGC TAGGCGGCGA TTTGGCGCAG
TGGAGTAGCA ACGCGCAGGA TCGCTGGCAT CTTGGCGTGA TGGCAGGCTA CGCCAATCAG
CACAGTAATA CTCAGAGTAA TCGTGTGGGT TATAAATCGG ATGGGCGCAT CAGCGGTTAC
AGCGCTGGGC TGTACGCGAC CTGGTATCAG AACGATGCGA ATAAGACCGG CGCTTATGTT
GACAGCTGGG CGCTGTATAA CTGGTTTGAT AACAGCGTCA GTTCCGATAA CCGTTCTGCT
GACGACTATG ATTCTCGCGG TGTGACGGCC TCTGTTGAGG GTGGGTATAC CTTTGAAGCG
GGAACATTTA GCGGCAGCGA AGGGACGCTG AATACCTGGT ACGTCCAGCC ACAGGCGCAA
ATCACCTGGA TGGGTGTGAA AGATTCCGAC CATACCCGGA AAGACGGAAC GCGCATTGAA
ACGGAAGGCG ACGGAAATGT GCAAACGCGA CTTGGGGTGA AAACCTACCT GAATAGCCAT
CACCAGCGTG ACGATGGTAA ACAGCGTGAG TTCCAGCCTT ACATTGAAGC GAACTGGATC
AACAATAGCA AAGTCTACGC CGTGAAGATG AATGGTCAAA CCGTAGGCCG TGAAGGTGCG
CGTAATCTCG GTGAAGTACG TACCGGGGTT GAGGCGAAAG TAAATAACAA CCTTAGCCTG
TGGGGGAATG TCGGTGTGCA ACTAGGTGAT AAAGGCTATA GCGATACTCA GGGCATGCTG
GGAGTGAAAT ATAGCTGGTA A
 
Protein sequence
MNSEGGKPGN VLTVNGNYTG NNGLMTFNAT LGGDNSPTDK MNVKGDTQGN TRVRVDNIGG 
VGAQTVNGIE LIEVGGNSAG NFALTTGTVE AGAYVYTLAK GKGNDEKNWY LTSKWDGVTP
ADTPDPINNP PVVDPEGPSV YRPEAGSYIS NIAAANSLFS HRLHDRLGEP QYTDSLHSQG
SASSMWMRHV GGHERSRAGD GQLNTQANRY VLQLGGDLAQ WSSNAQDRWH LGVMAGYANQ
HSNTQSNRVG YKSDGRISGY SAGLYATWYQ NDANKTGAYV DSWALYNWFD NSVSSDNRSA
DDYDSRGVTA SVEGGYTFEA GTFSGSEGTL NTWYVQPQAQ ITWMGVKDSD HTRKDGTRIE
TEGDGNVQTR LGVKTYLNSH HQRDDGKQRE FQPYIEANWI NNSKVYAVKM NGQTVGREGA
RNLGEVRTGV EAKVNNNLSL WGNVGVQLGD KGYSDTQGML GVKYSW