Gene Sbal223_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1664 
Symbol 
ID7086270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1944918 
End bp1946612 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content50% 
IMG OID643460565 
Productanthranilate synthase component I 
Protein accessionYP_002357592 
Protein GI217972841 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00684906 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000845967 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAAC AGACATTCGC ACGCTCAAGC ACACTCAAGG CGGCATTAAC CTACCACAGC 
GATCCACTGC GCTTGTATCA GCACATGACC CAAGATGCGC CCCATACTAT GTTGTTGGAG
TCGGCCGAAA TCGACAGTAA AGATCACTTA AAAAGCATGG TGATGACCCA TGCCGCCATG
ATGATCCGCT GCGACGGTTA TCAACTGACC TTTACCGCAC TGACCGACAA TGGCGCGAGC
TTACTTACCC CAATCGAAAC CTTCTTCAGC GAGAGCGGCG ATCGTGCTGA TATGAGTGCC
AATCTAGTCC GTGATAACTT GACCTTAGTG GTGACACTGC AAAAAGACAC TAAGCTGCAG
GATGAAGATG CACGCTTAAA ATCTACCTCG CCACTCGATG GCCTACGGAT GTTTATCCAG
CAAATTGATT GTGGCACTCA TACTGACAGC CAAAGTAAAC CCGCCTTTGA GGATCTGTTT
TTAGGTGGCG TGCTGGCCTA CGACTTGATT GATACCGTCG AACCACTGCC AGCCGTGCCG
AACCGCGATA ATGATTGTCC AGACTACTTA TTCTACCTCG CTGAAACCTT AATCCTTATT
GACCATAAAC TAAAACAAGC CGACATCATT ACCCATAATT TCAGCCGTGA TTCAGTCCAG
TATGCCGCCA TCACCGCAGC GCTGAGCGAG CGAGTACAGC TGTTAAGCAC CCAATGTAAA
ACTCTGGGTA ATTCACCTGC CGATGTGCCG ACACTGGTCG CCATCGACGC TACTGAGCAA
GTCAATATTT CCGATGAGGT GTTCAAACAA ACCGTTATCG ATTTGAAAGA ACACATTATT
GCGGGCGATA TTTTCCAAGT GGTGCCATCG CGTAGCTTTA GTTTACCTTG CCCGAATACC
TTAGGGGCTT ATCGCGCCCT TCGTCTAACT AACCCAAGCC CTTACATGTT TTATTTCAGG
GGCCAAGATT TCACGCTTTT TGGTGCTTCA CCAGAAAGCG CGCTTAAATA CGAGGCCAGC
AGCAATCAAG TCGAAGTCTA CCCGATTGCT GGCACCCGCA AACGCGGCAA AACCGCCACG
GGCGAGATTG ATTTTGACTT AGACAGCCGT ATTGAACTTG AACTGCGTTT AGATAAAAAA
GAACTGTCAG AACACTTAAT GTTGGTCGAT TTAGCTCGCA ACGATATCGC GCGTATCAGC
CAAAGCGGCA GCCGTAAAGT CGCCGAATTA TTGAAAGTGG ACCGTTATTC CCACGTGATG
CACCTCGTCA GTCGCGTAAC GGGTCAACTG CGCCAAGATT TAGATGCGCT GCATGCTTAT
CAGGCGTGTA TGAATATGGG CACTTTAGTT GGCGCGCCCA AAGTAAGCGC ATCACAACTG
GTTCGCCAAG CGGAAAAAGC CCGCCGCGGC AGCTACGGCG GCGCTGTGGG TTACCTTAAT
GCTCTTGGTG ATATGGACAC CTGTATTGTG ATCCGCTCGG CCTTTGTTAA AAATGGCACC
GCCTTTATTC AAGCGGGCGC GGGCGTCGTG TTTGATTCGG ATCCCCAAAG TGAGGCTGAC
GAAACCCGTC AAAAAGCCCA AGCCGTGATT TCGGCCATCA AAATGGGCGC TGGACTGCGA
GTCAATGAAT CGCCAGCAAA TGACGCGTCG GCTCAATCCA CTTTTGTGCA ATCCACTTCA
GTACAATCTA AATAG
 
Protein sequence
MPKQTFARSS TLKAALTYHS DPLRLYQHMT QDAPHTMLLE SAEIDSKDHL KSMVMTHAAM 
MIRCDGYQLT FTALTDNGAS LLTPIETFFS ESGDRADMSA NLVRDNLTLV VTLQKDTKLQ
DEDARLKSTS PLDGLRMFIQ QIDCGTHTDS QSKPAFEDLF LGGVLAYDLI DTVEPLPAVP
NRDNDCPDYL FYLAETLILI DHKLKQADII THNFSRDSVQ YAAITAALSE RVQLLSTQCK
TLGNSPADVP TLVAIDATEQ VNISDEVFKQ TVIDLKEHII AGDIFQVVPS RSFSLPCPNT
LGAYRALRLT NPSPYMFYFR GQDFTLFGAS PESALKYEAS SNQVEVYPIA GTRKRGKTAT
GEIDFDLDSR IELELRLDKK ELSEHLMLVD LARNDIARIS QSGSRKVAEL LKVDRYSHVM
HLVSRVTGQL RQDLDALHAY QACMNMGTLV GAPKVSASQL VRQAEKARRG SYGGAVGYLN
ALGDMDTCIV IRSAFVKNGT AFIQAGAGVV FDSDPQSEAD ETRQKAQAVI SAIKMGAGLR
VNESPANDAS AQSTFVQSTS VQSK