Gene Sfri_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfri_2201 
Symbol 
ID4279113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella frigidimarina NCIMB 400 
KingdomBacteria 
Replicon accessionNC_008345 
Strand
Start bp2630679 
End bp2632679 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content46% 
IMG OID638134996 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_750885 
Protein GI114563372 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAATC CAAACCGTCG TCAAACTCGA GCTAACGCTC AACAATTTAT TGATACCCTT 
AAACCGTTAC AACATCCTAA CTCGGAAAAA GTCTATTTAG CGGGTAGTCG TGCTGATATC
AACGTTGCTA TGCGTCAAGT TAATCAAACT GATACCGTTA TTGGTGGTAC CGACGCTGAA
CCCATTAAAC AAGCCAATCC GCCGATCATG GTTTATGACT GCGCGGGTGC TTATTCTGAT
CCCCTTGCTG ATATCAATGT GCGTGAAGGC TTACCTAAAA TCCGTCAAGC ATGGATTGAT
GAGCGACAAG ATACTGAGCA ATTAACGGGT GCAAGTTCGG GTTTTACCCA ACAACGACTG
GCCGACGATG GACTCGATCA TTTACGATTT GACGCCTTAG TCGCGCCTAA ACGTGCCAAG
AAGGGCAAAT GCGTGACCCA AATGTATTAT GCACGCCAAG GTATTATTAC CCCTGAAATG
GAATACATTG CCATTCGTGA AAACATGGCT CGCGCTGAAG TAACCGATAG CGTGTTGACT
CAAAAAGCCC CAGGTGAGAA TTTTGGTGCA CATATTGGGC AACTTATCAC TCCTGAGTTT
GTCCGCGATG AAGTCGCCCG AGGTCGAGCA ATTATTCCGC TGAATATTAA CCATCCTGAA
ACTGAACCTA TGATTATTGG GCGTAATTTT TTGGTGAAGG TGAATGCCAA TATTGGTAAT
TCGGCGGTGA CATCTTCGAT TGAAGAAGAA GTTGAAAAAC TCGTGTGGTC TACCCGTTGG
GGCGCCGATA CGGTCATGGA TTTGTCGACT GGACGTTATA TTCATGAAAC GCGTGAGTGG
CTTATCCGTA ATTCGCCTGT GCCGATTGGT ACTGTGCCGA TTTATCAAGC ATTAGAAAAG
GTTAATGGCG TTGCCGAAGA CCTCAGTTGG GCGGTATTTA GAGATACATT GATTGAACAA
GCCGAGCAGG GCGTAGACTA TTTTACGATT CATGCAGGGG TGTTATTGCG TTATGTACCC
ATGACCGCCA AACGCCTTAC CGGCATTGTG TCGCGTGGCG GTTCAATTAT GGCTAAGTGG
TGTTTAAGTC ATCATCAAGA AAGTTTCTTA TATGAACATT TTCGCGATAT TTGCCAAATT
TGCGCCCAAT ACGATGTGTC GTTGTCGTTA GGGGATGGTA TGCGCCCCGG TTCAATTGCC
GACGCCAATG ATGAAGCGCA ATTTGCGGAA TTAGAAACCT TAGGTGAATT GGTCAAAATT
GCATGGGAAT ACGATGTGCA GACCATCATC GAAGGCCCAG GTCATGTGCC AATGCAATTG
ATTAAAGTCA ATATGGAAAA GCAATTGGCC CTTTGTGATG AAGCGCCATT TTATACATTA
GGCCCACAAA CAACCGATAT TGCGCCTGGC TATGATCACT TCACCTCAGG TATTGGTGCT
GCCATGATGG CTTGGTACGG CGTGGCGATG TTGTGTTACG TCACCCCTAA AGAACACTTA
GGTTTGCCTA ACAAAGAAGA TGTAAAACAA GGCTTAATTG CCTACAAGAT TGCTGCCCAT
GCTGCGGATG TCGCTAAAGG TCATCCAGGT GCGCAAGTAC GTGATAATGC ATTATCAAAA
GCCCGCTTTG AATTCCGTTG GGAAGATCAA TATAACCTAG GTCTTGATCC TGATACCGCT
CGTGCTTACC ACGACGAATC GTTACCACAA GAATCAGCCA AAGTGGCTCA TTTTTGTTCA
ATGTGTGGGC CTAAATTCTG TTCGATGAAA ATAACCCACG AAGTACGTGA ATATGCCGCG
AATCTTGAAC AAGCTGAAGC ACTTAAGATT GAAGTATCTT CTGCTAGTGC TTATCAAGCT
CTACATACGG CTCAAGCCAT AAATCCGTCC GAAGCGATGG CGCAAAAGTC GGCAGAGTTC
AAAGCGTCGG GCTCGGCGCT TTATCACGAC GTCAAACCCG CAGCACAAGT TAATACTGAC
TTAACGGTAG AGGTTGAATA A
 
Protein sequence
MSNPNRRQTR ANAQQFIDTL KPLQHPNSEK VYLAGSRADI NVAMRQVNQT DTVIGGTDAE 
PIKQANPPIM VYDCAGAYSD PLADINVREG LPKIRQAWID ERQDTEQLTG ASSGFTQQRL
ADDGLDHLRF DALVAPKRAK KGKCVTQMYY ARQGIITPEM EYIAIRENMA RAEVTDSVLT
QKAPGENFGA HIGQLITPEF VRDEVARGRA IIPLNINHPE TEPMIIGRNF LVKVNANIGN
SAVTSSIEEE VEKLVWSTRW GADTVMDLST GRYIHETREW LIRNSPVPIG TVPIYQALEK
VNGVAEDLSW AVFRDTLIEQ AEQGVDYFTI HAGVLLRYVP MTAKRLTGIV SRGGSIMAKW
CLSHHQESFL YEHFRDICQI CAQYDVSLSL GDGMRPGSIA DANDEAQFAE LETLGELVKI
AWEYDVQTII EGPGHVPMQL IKVNMEKQLA LCDEAPFYTL GPQTTDIAPG YDHFTSGIGA
AMMAWYGVAM LCYVTPKEHL GLPNKEDVKQ GLIAYKIAAH AADVAKGHPG AQVRDNALSK
ARFEFRWEDQ YNLGLDPDTA RAYHDESLPQ ESAKVAHFCS MCGPKFCSMK ITHEVREYAA
NLEQAEALKI EVSSASAYQA LHTAQAINPS EAMAQKSAEF KASGSALYHD VKPAAQVNTD
LTVEVE