Gene EcE24377A_E0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_E0022 
SymbolsopA 
ID5585882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009790 
Strand
Start bp20998 
End bp22164 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content47% 
IMG OID640913913 
Productplasmid-partitioning protein SopA 
Protein accessionYP_001451563 
Protein GI157149534 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1192] ATPases involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.19504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA TGGAAACACT TAACCAGTGC ATAAACGCTG GTCATGAAAT GACGAAGGCT 
ATCGCCATTG CACAGTTTAA TGATGACAGC CCGGAAGCGA GGAAAATAAC CCGGCGCTGG
AGAATAGGTG AAGCAGCGGA TTTAGTTGGG GTTTCTTCTC AGGCTATCAG AGATGCCGAG
AAAGCAGGGC GACTACCGCA CCCGGATATG GAAATTCGAG GACGGGTTGA GCAACGTGTT
GGTTATACAA TTGAACAAAT TAATCATATG CGTGATGTGT TTGGTACGCG ATTGCGACGT
GCTGAAGACG TATTTCCACC GGTGATCGGG GTTGCTGCCC ATAAAGGTGG CGTTTACAAA
ACCTCAGTTT CTGTTCATCT TGCTCAGGAT CTGGCTCTGA AGGGGCTACG TGTTTTGCTC
GTGGAAGGTA ACGACCCCCA GGGAACAGCC TCAATGTATC ACGGATGGGT ACCAGATCTT
CATATTCATG CAGAAGACAC TCTCCTGCCT TTCTATCTTG GGGAAAAGGA CGATGTCACT
TATGCAATAA AGCCCACTTG CTGGCCGGGG CTTGACATTA TTCCTTCCTG TCTGGCTCTG
CACCGTATTG AAACTGAGTT AATGGGCAAA TTTGATGAAG GTAAACTGCC CACCGATCCA
CACCTGATGC TCCGACTGGC CATTGAAACT GTTGCTCATG ACTATGATGT CATAGTTATT
GACAGCGCGC CTAACCTGGG TATCGGCACG ATTAATGTCG TATGTGCTGC TGATGTGCTG
ATTGTTCCCA CGCCTGCTGA GTTGTTTGAC TACACCTCCG CACTGCAGTT TTTCGATATG
CTTCGTGATC TGCTCAAGAA CGTTGATCTT AAAGGGTTCG AGCCTGATGT ACGTATTTTG
CTTACCAAAT ACAGCAATAG TAATGGCTCT CAGTCCCCGT GGATGGAGGA GCAAATTCGG
GATGCCTGGG GAAGCATGGT TCTAAAAAAT GTTGTACGTG AAACGGATGA AGTTGGTAAA
GGTCAGATCC GGATGAGAAC TGTTTTTGAA CAGGCCATTG ATCAACGCTC TTCAACTGGT
GCCTGGAGAA ATGCTCTTTC TATTTGGGAA CCTGTCTGCA ATGAAATTTT CGATCGTCTG
ATTAAACCAC GCTGGGAGAT TAGATAA
 
Protein sequence
MKLMETLNQC INAGHEMTKA IAIAQFNDDS PEARKITRRW RIGEAADLVG VSSQAIRDAE 
KAGRLPHPDM EIRGRVEQRV GYTIEQINHM RDVFGTRLRR AEDVFPPVIG VAAHKGGVYK
TSVSVHLAQD LALKGLRVLL VEGNDPQGTA SMYHGWVPDL HIHAEDTLLP FYLGEKDDVT
YAIKPTCWPG LDIIPSCLAL HRIETELMGK FDEGKLPTDP HLMLRLAIET VAHDYDVIVI
DSAPNLGIGT INVVCAADVL IVPTPAELFD YTSALQFFDM LRDLLKNVDL KGFEPDVRIL
LTKYSNSNGS QSPWMEEQIR DAWGSMVLKN VVRETDEVGK GQIRMRTVFE QAIDQRSSTG
AWRNALSIWE PVCNEIFDRL IKPRWEIR