Gene Sbal195_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2801 
Symbol 
ID5754579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp3334320 
End bp3335999 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content49% 
IMG OID641289113 
Productanthranilate synthase component I 
Protein accessionYP_001555228 
Protein GI160875912 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.790678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAC AGACATTCGC ACGCTCAAGC ACACTCAAGG CGGCATTAAC CTACCACAGC 
GATCCACTGC GCTTGTATCA GCACATTACC CAAGATGCGC CCCATACTAT GTTGTTGGAG
TCGGCCGAAA TCGACAGTAA AGATCACTTA AAAAGCATGG TGATGACCCA TGCCGCCATT
ATGATCCGCT GCGACGGTTA TCAACTGACC TTTACCGCAC TGACCGACAA TGGCGCGAGC
TTACTTACCC CAATCGAAAC CTTCTTCAGC GCGAGCGGCG ATCGTGCTGA TATGAGTGCC
AATCTAGTCC GTGATAACTT GACCTTAGTG GTGACACTGC AAAAAGACAC TAAGCTGCAG
GATGAAGATG CACGCTTAAA ATCTACTTCG CCACTCGATG GCCTACGGAT GTTTATCCAG
CAAATTGATT GTGGCACTCA TACTGACAGC CAAAGTAAAC CCGCCTTTGA GGATCTGTTT
TTAGGTGGCG TGTTGGCCTA CGACTTGATT GATACCGTCG AACCGCTGCC AGCCGTACCG
AACCGCGATA ATGATTGCCC AGACTACTTA TTTTACCTCG CTGAAACCTT AATCCTTATC
GACCATAAAC TGAAACAAGC CGACATCATT ACCCATAATT TCAGTCGTGA TTCCGCCCAG
CATACCGCCA TCACCGCAGC GCTGAGCGAG CGAGTTCAGC ATCTAAGCAC ACAATGTAAA
ACCCTCGGTA ATTCACCTGC CGATGTGCCG ACACTGGTCG CCATCGACGC TACTGAGCAA
GTCAATATTT CCGATGAGGT GTTCAAACAA ACCGTTATCG ATTTGAAAGA ACACATTATT
GCGGGCGATA TTTTCCAAGT GGTGCCATCG CGTAGTTTTA GTTTACCTTG CCCGAATACC
TTAGGGGCTT ATCGCGCCCT GCGTCTAACT AACCCAAGCC CTTACATGTT TTATTTCAGA
GGCCAAGATT TCACGCTTTT TGGTGCTTCA CCAGAAAGTG CGCTTAAGTA CGAGGCCAGC
AGCAATCAAG TCGAAGTCTA CCCGATTGCC GGCACCCGCA AACGCGGCAA AACCGCCACG
GGCGAGATTG ATTTTGACTT AGACAGCCGC ATTGAACTTG AACTGCGTTT AGATAAAAAA
GAACTGTCAG AACACTTAAT GTTGGTTGAT TTAGCTCGCA ACGATATCGC GCGCATCAGC
CAAAGCGGCA GCCGTAAAGT CGCTGAATTA TTGAAAGTTG ATCGCTATTC TCACGTGATG
CACCTTGTCA GTCGCGTAAC GGGTCAATTG CGCCAAGATT TAGATGCGCT GCATGCTTAT
CAGGCCTGTA TGAATATGGG TACTTTAGTT GGCGCCCCCA AAGTAAGCGC ATCACAACTG
GTTCGCCAAG CAGAAAAAGC CCGCCGTGGC AGCTACGGCG GCGCTGTGGG TTACCTTAAT
GCCCTTGGGG ATATGGACAC TTGCATAGTG ATCCGCTCCG CCTTTGTTAA AAATGGTACC
GCCTTTATTC AAGCGGGCGC GGGCGTGGTA TTTGATTCGG ATCCCCAAAG CGAGGCAGAT
GAAACCCGTC AAAAAGCCCA AGCCGTGATT TCAGCCATCA AGATGGGCGC TGGACTGCTA
GCAAATGAAT CGTCAGCTCA ATCCACTTCA GCACAATCCT CTTCAGTGCA ATATAAATAG
 
Protein sequence
MPKQTFARSS TLKAALTYHS DPLRLYQHIT QDAPHTMLLE SAEIDSKDHL KSMVMTHAAI 
MIRCDGYQLT FTALTDNGAS LLTPIETFFS ASGDRADMSA NLVRDNLTLV VTLQKDTKLQ
DEDARLKSTS PLDGLRMFIQ QIDCGTHTDS QSKPAFEDLF LGGVLAYDLI DTVEPLPAVP
NRDNDCPDYL FYLAETLILI DHKLKQADII THNFSRDSAQ HTAITAALSE RVQHLSTQCK
TLGNSPADVP TLVAIDATEQ VNISDEVFKQ TVIDLKEHII AGDIFQVVPS RSFSLPCPNT
LGAYRALRLT NPSPYMFYFR GQDFTLFGAS PESALKYEAS SNQVEVYPIA GTRKRGKTAT
GEIDFDLDSR IELELRLDKK ELSEHLMLVD LARNDIARIS QSGSRKVAEL LKVDRYSHVM
HLVSRVTGQL RQDLDALHAY QACMNMGTLV GAPKVSASQL VRQAEKARRG SYGGAVGYLN
ALGDMDTCIV IRSAFVKNGT AFIQAGAGVV FDSDPQSEAD ETRQKAQAVI SAIKMGAGLL
ANESSAQSTS AQSSSVQYK