Gene Sbal223_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1401 
Symbol 
ID7088826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1644567 
End bp1645796 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content47% 
IMG OID643460304 
Productprotein of unknown function DUF201 
Protein accessionYP_002357331 
Protein GI217972580 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA ATCTTCTTAT TATTGGTGCA GGCATTGAAG CTTGTGAGGG GATAAAGGTT 
GCGAAAGGCA TGGGACTTGG GTTAATCATC GCCGATGGCA ACCCTGCAGC CCCAGGGCTA
GCCTTGGCCG ACTGGCAAAT TATTGAGAGC ACCTACGATG GCGAAGCCAT TCTAGACCAA
GTTAAAAAGC TTCAAGCAAA CGGAATACAT ATTGATGGTG CCATCGCTAT GTGTGCGGAT
GTGCCATTAA GTGTCGCCAC CGTGACGAAT GCACTCGGGC TACCGGGATT GTCAATGGAT
AGCGCATTCC TAGTATCTGA TAAACTGGCG ATGAAAATTA AACTCCAATC TGAAGGGATC
CCGATACCAC GGTTTGCAGA CGTATCAGAT AAATTAAAGT TAGTAGAACA AGCCACCGCT
ATCGGCTTTC CACTTATTAT TAAACCCGTA GATAGTCGTG GTGCACGTGG AGTTCAGCTT
ATTGAGACCC CCGAAACCCT AGACACGGCT TGGCAACTAG CGGCCAAAGA GTCCCCCACA
TCAAGAGTCA TGCTAGAAGA ATATCTTGAA GGGCCACAAT TTAGCACGGA AACCTTAGTT
GATAGAGGGC GATGCCATAC ACTGGGATTT GCCGACAGAA ACTACGAATG GCTAGCACGC
ACTAAGCCTT TTATTATTGA AAATGGTGGT GATGCACCGA CGATTGTGAG TACAGATATA
AATGCAGAAG TGATAGCCAC TGTTGAGAAA GCTGCCGCGG CACTGGGCAT CCATCAAGGC
ATCGCTAAAG GTGACATGGT TTACACTGCC GAGGGAGCGA AAGTGATTGA AATTGCAGGA
CGGCTCTCAG GAGGGTTCTT TTCAACCACA CAAATCCCAC TAGCAACTGG GGTTAATTTT
ATCGAAAAAG CAATTAAATT GGCACTGGGT GAACCTTTAA CTGATGATGA AGTTACTGCC
AAATATCAGC GGGCGGTTGC CATACGCTAC CTTGATCTTG CGCCAGGTAA AGTAAGCCGT
ATTCACGGGA TCTCAAAGGC AAGCGGGGCT TCTGGGATTG AAATGTTAAA AGTATTCGTT
GGGCCTGGAA GCAACATTTA CCCACTGACC AATCATACAC AAAGAGCGGG TTTTGCTATT
GCAAGTGCTG AAAAAAAACA AGACGCGATA GCAAGAGCAC TCGCAGCCTT ATCTCAGATA
ACAGTCGAAT ATGAGGATTT GCAGACATGA
 
Protein sequence
MTKNLLIIGA GIEACEGIKV AKGMGLGLII ADGNPAAPGL ALADWQIIES TYDGEAILDQ 
VKKLQANGIH IDGAIAMCAD VPLSVATVTN ALGLPGLSMD SAFLVSDKLA MKIKLQSEGI
PIPRFADVSD KLKLVEQATA IGFPLIIKPV DSRGARGVQL IETPETLDTA WQLAAKESPT
SRVMLEEYLE GPQFSTETLV DRGRCHTLGF ADRNYEWLAR TKPFIIENGG DAPTIVSTDI
NAEVIATVEK AAAALGIHQG IAKGDMVYTA EGAKVIEIAG RLSGGFFSTT QIPLATGVNF
IEKAIKLALG EPLTDDEVTA KYQRAVAIRY LDLAPGKVSR IHGISKASGA SGIEMLKVFV
GPGSNIYPLT NHTQRAGFAI ASAEKKQDAI ARALAALSQI TVEYEDLQT