Gene Sbal223_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4072 
Symbol 
ID7089690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4832725 
End bp4833684 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content46% 
IMG OID643462950 
Productbiotin--protein ligase 
Protein accessionYP_002359968 
Protein GI217975217 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.228403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA ATTGGGGAAG AAAGCGCCAG ATATTAGCGT TATTGTCTAG CGGTCAATTT 
ATTTCAGGCG AGCAGTTAGC CACAGAACTT GGGATTTCGC GCGCCGCTGT AAATAAGCAT
ATTGATGCGT TAGAAACCTA TGGTGTGGCA ATTTATAGCG TTAAAGGTCG CGGCTATAAG
CTAGCCAATC CCATCTCTTT GATTGATGCT TCACGTTTAG TGCAGTCAAT TGATAACCGT
TGTTTTTATT TTGATGAGAT CGCAAGTACC AACGGCTTTA TGCTGAGTCA TACCACTGAG
CTAAAAAATG GCGATGTGTG CGTGGCAGAG TACCAATCTG CAGGTCGCGG TCGCCGAGGT
CGCACTTGGG TGTCGCCCTA TGGGCATCAC TTGTACTTCT CATTGTTTTG GACATTCCCG
CAGGGAATGG CACAGGCCAT GGGTCTAAGT TTAGTGGTGG CGTGCACTCT AGTTGAAGTG
CTTAAATCGT TTGGGGTCGA GAATATTGGG GTTAAGTGGC CGAATGATAT CTATTTGGAT
AACAAGAAGC TTGCCGGGAT CTTGATTGAA ATGTCGGGAC AGGTGGATAG TCAGTGTCAG
CTGATCATTG GTGTTGGCGT TAATATGGCG ATGTCAGATG AGCAAGGCAA AGGTATCGAT
CAGCCTTGGA GTGACCTGTC AGAGTTGGTC GATATGCCAG ATAAGACCGC GCTTGTCATT
GAATTACAGA AGCAGCTAAA GCGTGATATC CAGCTATTTG AACGTGAAGG ATTAGCTGCA
TTCAAGGCTC GTTGGCAAGC AGCGGATCTA TTTTTTGGAC GTGAAATTCG GTTATTAATG
GCTGATAACT TTGTGGATGG TATTTGTCGT GGTGTTGATG AGCAGGGGGC GGTATTGCTC
GAAACCGCCG ACGGTATGCA AGCATTTATC GGCGGTGAAA TTAGCTTAAG AGCGCGCTAA
 
Protein sequence
MSDNWGRKRQ ILALLSSGQF ISGEQLATEL GISRAAVNKH IDALETYGVA IYSVKGRGYK 
LANPISLIDA SRLVQSIDNR CFYFDEIAST NGFMLSHTTE LKNGDVCVAE YQSAGRGRRG
RTWVSPYGHH LYFSLFWTFP QGMAQAMGLS LVVACTLVEV LKSFGVENIG VKWPNDIYLD
NKKLAGILIE MSGQVDSQCQ LIIGVGVNMA MSDEQGKGID QPWSDLSELV DMPDKTALVI
ELQKQLKRDI QLFEREGLAA FKARWQAADL FFGREIRLLM ADNFVDGICR GVDEQGAVLL
ETADGMQAFI GGEISLRAR