Gene Shew_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1903 
Symbol 
ID4921625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2197686 
End bp2199119 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content56% 
IMG OID640163471 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001094028 
Protein GI127512831 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.757454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000298864 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATTTTT CGTCGACTCA AGGCGTTGCA CTGCAAAAGT TAGACTGGCA ACTCAGCACC 
ATTGAGGTAT TTGACTGCTT CGCTCACCTG CCCTGGGCCA TACTGCTGGA CTCCGCCGGC
GCCGATCATA TAGACGCCCG GTACGACATC ATCAGCTTCG ATCCTCTGGC CACCATCACC
AGCCAGGATG GTGTCACCCA TACCAGACAT CTGCGTCCCA CTGCGGCTAG TGCTACAGAA
GAAAATCAAT CTAAAGACGG TGTTGAAATT AGCCAGGACG ATCCCTTATC GATATTGCAA
GCGGCGATAG CCGACTACTT TCCGACTCAA CATGCCTGTG AGCTGCCCTT CAGCGGCGGC
GCCCTGGGCA CCTTTAGCTA TGATCTGGGC AGACGCATCG AGAAGCTACC CACCATTGCA
GCCCAAGATA TCGAACTTCC CGAGATGAAT ATCGGCCTCT ATGACTGGGC GCTGCTGTTT
TGCTACCAGA GCCAGACCTG GTCTCTGGTG CACTACCGAG GTGAAACGGC GCTTAAAGAA
CGTCTAGCCG ATTTAGAGGC AAGGTTAAGC TCGCCGTCGA ACGAAAAAGC CAATGGTGAG
AAGTTTGCCC TGAGCCGTGA CTGGCAGCCA CAGATCACTA AAGGCGAATA CAGAGACAAG
TTTGATCGGG TGCAGGATTA TCTACACAGC GGCGATTGCT ATCAGATCAA CCTTACCCAG
CGTTTCGAGG CCGAGTATCG CGGTGACGAG TGGCAGGCCT ACCTTAAACT CAGGGCCAGC
AACAAGGCGC CCTTCTCTGC CTTTATCCGC CTGGATATGC ACGCCATCCT CTCCATCTCG
CCCGAGCGTT TCATCAAGCT TAAGGGCGAT GCCATAGAGA CCAAGCCGAT CAAGGGCACC
ATGGCGCGAT CCAGCGACGC CACGGCCGAT AAGGCCGCCG CCGAAGCCCT GGCGGCATCG
GAGAAAGATC GCGCCGAAAA CCTGATGATC GTGGATCTGC TTCGAAACGA TATCGGCAAG
GTGGCAAGCC CTGGCAGCGT GCGGGTGCCC CATCTGTTTG CCATCGAAAG CTTCCCGGCG
GTGCACCATC TGGTGAGCAC TGTCACCGCC AATCTCGCTG CGCCCAACAG CCCCTGCGAT
CTATTAAGGG CAGCATTTCC CGGTGGCTCT ATCACGGGCG CACCTAAGAT CCGCGCCATG
GAGATCATCG AAGAGCTGGA GCCATCGCGC CGCAGCCTCT ACTGTGGCTC CATCGGCTAT
ATCAGCCAGG ACAGGCAGAT GGATACCAGC ATCACCATAC GTACCTTGGT GGCCGAGCCA
CCTCGTCTCT ACTGCTGGGC GGGCGGTGGC ATTGTGGCGG ACTCTCAGGT CGATGCCGAA
TATCAGGAGA GTTATGATAA GGTGAGTAAG ATCCTCCCCG TGCTGAGTAA CTGA
 
Protein sequence
MDFSSTQGVA LQKLDWQLST IEVFDCFAHL PWAILLDSAG ADHIDARYDI ISFDPLATIT 
SQDGVTHTRH LRPTAASATE ENQSKDGVEI SQDDPLSILQ AAIADYFPTQ HACELPFSGG
ALGTFSYDLG RRIEKLPTIA AQDIELPEMN IGLYDWALLF CYQSQTWSLV HYRGETALKE
RLADLEARLS SPSNEKANGE KFALSRDWQP QITKGEYRDK FDRVQDYLHS GDCYQINLTQ
RFEAEYRGDE WQAYLKLRAS NKAPFSAFIR LDMHAILSIS PERFIKLKGD AIETKPIKGT
MARSSDATAD KAAAEALAAS EKDRAENLMI VDLLRNDIGK VASPGSVRVP HLFAIESFPA
VHHLVSTVTA NLAAPNSPCD LLRAAFPGGS ITGAPKIRAM EIIEELEPSR RSLYCGSIGY
ISQDRQMDTS ITIRTLVAEP PRLYCWAGGG IVADSQVDAE YQESYDKVSK ILPVLSN