Gene B21_03491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03491 
Symbolade 
ID8112620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3730409 
End bp3732175 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content55% 
IMG OID644849662 
Producthypothetical protein 
Protein accessionYP_003001235 
Protein GI251786931 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATT CTATTAACCA TAAATTTCAT CACATTAGCC GGGCTGAATA CCAGGAATTG 
TTAGCCGTTT CCCGTGGCGA CGCTGTTGCC GATTATATTA TTGATAATGT CTCTATTCTC
GACCTGATCA ATGGCGGAGA AATTTCCGGC CCAATTGTGA TTAAAGGACG TTACATTGCC
GGTGTTGGCG CAGAATACAC TGATGCTCCG GCTTTGCAGC GGATTGATGC TCGCGGCGCA
ACGGCGGTGC CAGGGTTTAT TGATGCTCAC CTGCATATTG AATCCAGCAT GATGACGCCG
GTCACTTTTG AAACCGCTAC CCTGCCGCGC GGCCTGACGA CCGTTATTTG CGACCCTCAT
GAAATCGTCA ACGTGATGGG CGAAGCCGGA TTCGCCTGGT TTGCCCGCTG TGCCGAACAG
GCAAGGCAAA ACCAGTACTT ACAGGTCAGC TCTTGCGTAC CCGCCCTGGA AGGCTGCGAT
GTTAACGGTG CCAGTTTTAC CCTTGAACAG ATGCTCGCCT GGCGGGACCA TCCGCAGGTT
ACCGGCCTTG CAGAAATGAT GGACTACCCT GGCGTAATTA GCGGGCAGAA TGCGCTGCTC
GATAAACTGG ATGCATTTCG CCACCTGACG CTGGACGGTC ACTGCCCGGG TTTGGGTGGT
AAAGAACTTA ACGCCTATAT TACTGCGGGT ATTGAAAACT GCCACGAAAG TTATCAGCTG
GAAGAAGGAC GCCGGAAATT ACAACTCGGC ATGTCGTTGA TGATCCGCGA AGGGTCCGCT
GCCCGCAATC TCAACGCGCT GGCACCGTTG ATCAACGAAT TTAACAGCCC GCAATGCATG
CTCTGTACCG ATGACCGTAA CCCGTGGGAG ATCGCCCATG AAGGACACAT CGATGCCTTA
ATTCGCCGCC TGATCGAACA ACACAATGTG CCGCTGCATG TGGCATATCG CGTCGCCAGC
TGGTCGACGG CGCGCCACTT TGGTCTGAAT CACCTCGGCT TACTGGCACC CGGCAAGCAG
GCCGATATCG TCCTGTTGAG CGATGCGCGT AAGGTCACGG TGCAGCAGGT ACTGGTGAAA
GGCGAGCCGA TTGATGCGCA AACCTTACAG GCGGAAGAGT CGGCGAGACT GGCACAATCC
GCTCCGCCAT ATGGCAACAC CATTGCCCGC CAGCCAGTTT CCGCCAGCGA CTTTGCCCTG
CAATTTACGC CCGGAAAACG CTATCGGGTC ATTGACGTCA TCCATAACGA ATTGATTACG
CACTCCCACT CCAGCGTCTA CAGCGAAAAT GGTTTTGATC GCGATGATGT GAGCTTTATT
GCCGTACTTG AGCGTTACGG GCAACGGCTG GCTCCGGCTT GTGGTTTGCT TGGCGGCTTT
GGACTGAATG AAGGTGCGCT GGCTGCGACG GTCAGCCATG ACAGCCATAA TATTGTGGTG
ATCGGTCGCA GTGCCGAAGA GATGGCGCTG GCGGTCAATC AGGTGATTCA GGATGGCGGC
GGGCTGTGCG TGGTACGTAA CGGCCAGGTA CAAAGTCATC TGCCGTTACC CATTGCCGGG
CTGATGAGCA CCGACACGGC GCAGTCGCTG GCGGAACAAA TTGACGCCTT GAAAGCCGCC
GCCCGTGAAT GCGGTCCGTT ACCCGATGAG CCGTTTATTC AGATGGCGTT TCTTTCTCTG
CCAGTGATCC CCGCGCTAAA ACTAACCAGT CAGGGGCTAT TTGATGGCGA GAAGTTTGCC
TTCACTACGC TGGAAGTCAC GGAATAA
 
Protein sequence
MNNSINHKFH HISRAEYQEL LAVSRGDAVA DYIIDNVSIL DLINGGEISG PIVIKGRYIA 
GVGAEYTDAP ALQRIDARGA TAVPGFIDAH LHIESSMMTP VTFETATLPR GLTTVICDPH
EIVNVMGEAG FAWFARCAEQ ARQNQYLQVS SCVPALEGCD VNGASFTLEQ MLAWRDHPQV
TGLAEMMDYP GVISGQNALL DKLDAFRHLT LDGHCPGLGG KELNAYITAG IENCHESYQL
EEGRRKLQLG MSLMIREGSA ARNLNALAPL INEFNSPQCM LCTDDRNPWE IAHEGHIDAL
IRRLIEQHNV PLHVAYRVAS WSTARHFGLN HLGLLAPGKQ ADIVLLSDAR KVTVQQVLVK
GEPIDAQTLQ AEESARLAQS APPYGNTIAR QPVSASDFAL QFTPGKRYRV IDVIHNELIT
HSHSSVYSEN GFDRDDVSFI AVLERYGQRL APACGLLGGF GLNEGALAAT VSHDSHNIVV
IGRSAEEMAL AVNQVIQDGG GLCVVRNGQV QSHLPLPIAG LMSTDTAQSL AEQIDALKAA
ARECGPLPDE PFIQMAFLSL PVIPALKLTS QGLFDGEKFA FTTLEVTE