Gene SbBS512_E2740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2740 
Symbol 
ID6269789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2542720 
End bp2543865 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID641726701 
Producthypothetical protein 
Protein accessionYP_001881180 
Protein GI187732857 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.382378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATA ATGAAAGCAA AGGGCCGTTT GAAGGCTTAT TAGTTATCGA TATGACACAT 
GTCCTTAATG GACCTTTCGG AACTCAACTT CTTTGTAATA TGGGCGCAAG GGTAATTAAA
GTTGAGCCGC CGGGTCATGG TGATGATACC CGCACATTTG GTCCCTATGT GGATGGACAG
TCACTCTATT ACAGTTTTAT TAATCATGGC AAAGAGAGTG TGGTTCTTGA TTTAAAGAAT
GATTACGATA AAAGTATATT TATAAATATG CTCAAACAAG CTGATGTATT AGCTGAGAAT
TTTCGCCCAG GTACAATGGA AAAACTGGGG TTTTCATGGG AAACGCTTCA AGAAATCAAC
CCGAGCCTCA TATATGCTTC ATCGTCAGGT TTCGGACATA CCGGTCCGCT AAAAGATGCT
CCTGCCTACG ATACCATCAT TCAGGCAATG AGCGGGATAA TGATGGAAAC AGGATATCCT
GATGCTCCGC CAGTGCGCGT TGGTACATCT CTTGCGGATC TATGCGGCGG TGTCTATTTA
TTCAGCGGAA TAGTGAGTGC ACTTTATGGC CGCGAAAAGA GCCAGAGAGG GGCGCATGTC
GATATAGCGA TGTTTGATGC CACGCTGAGT TTTCTGGAGC ATGGTCTGAT GGCATATATC
GCGACAGGGA AGTCACCACA ACGTCTGGGA AATCGCCATC CCTACATGGC ACCTTTTGAT
GTTTTCAATA CTCAGGATAA GCCGATTACG ATTTGTTGTG GTAATGACAA GCTTTTTTCT
GCGTTATGCC AGGCACTGGA GCTTACGGAA CTGGTTAATG ATCCCCGATT TAGCAGCAAT
ATTTTACGCG TACAAAACCA GGCTATTCTT AAACAATATA TTGAGCGGAC GTTAAAAACG
CAGGCAGCTG AAGTTTGGTT AGCCAGAATA CATGAAGTTG GTGTACCCGT CGCGCCGTTA
TTAAGTGTGG CTGAGGCCAT TAAATTGCCA CAAACTCAGG CGAGAAATAT GTTGATTGAA
GCCGGGGGAA TAATGATGCC GGGTAATCCG ATAAAAATCA GCGGCTGCGC GGACCCGCAT
GTTATGCCGG GAGCGGCAAC GCTCGACCAG CATGGGGAAC AAATTCGCCA GGAGTTCTCA
TCATAA
 
Protein sequence
MTNNESKGPF EGLLVIDMTH VLNGPFGTQL LCNMGARVIK VEPPGHGDDT RTFGPYVDGQ 
SLYYSFINHG KESVVLDLKN DYDKSIFINM LKQADVLAEN FRPGTMEKLG FSWETLQEIN
PSLIYASSSG FGHTGPLKDA PAYDTIIQAM SGIMMETGYP DAPPVRVGTS LADLCGGVYL
FSGIVSALYG REKSQRGAHV DIAMFDATLS FLEHGLMAYI ATGKSPQRLG NRHPYMAPFD
VFNTQDKPIT ICCGNDKLFS ALCQALELTE LVNDPRFSSN ILRVQNQAIL KQYIERTLKT
QAAEVWLARI HEVGVPVAPL LSVAEAIKLP QTQARNMLIE AGGIMMPGNP IKISGCADPH
VMPGAATLDQ HGEQIRQEFS S