Gene Shewana3_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1520 
Symbol 
ID4477416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1770319 
End bp1772010 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content52% 
IMG OID639726090 
Productanthranilate synthase component I 
Protein accessionYP_869160 
Protein GI117919968 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0360897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTAA AGACATTTAA TCAGGTTACC CAAGTTGATG GCGCGCATTT TGCCCCGTCT 
CAACAGACAT TCGCACGTTC GCATACGCTC AAGGCGACGC TGGCATATCA TAGCGACCCA
CTGCGCCTGT ACCAGCACAT CACCCAAGAT GTACCCCATA CCATGTTGTT AGAATCGGCA
GAAATCGACA GCAAGGAAAA TCTTAAGAGC ATGGTGATGA CCCATGCGGC GCTGATGATC
CGCTGCGATG GCTATCGTTT ACGCTTTAGC GCACTGAGTG ACAATGGCGT CAGTTTACTC
GCCCCCATCG AGCAATTTTT TATTGCTCGT TCAAGCCAAA CTCAATGCCA ACGCGATGGC
CACAACTTAG TGGTCACGCT GCAAAAGGAC ACTGAGCTTA AGGATGAAGA TGCCCGCTTA
AAATCCACCT CGCCCCTCGA TGGTTTGCGC TTGTTTGTAA AGCATATCGA CTGCGGCCAG
ACGCCAGCAT TCGAAGACTT ATTTTTAGGT GGCGTGCTCT CCTACGATTT GATTGATACC
GTCGAGCCAC TGCCTGAAGC ACCAAATGGT GCAAATGATT GTCCTGATTA TTTATTTTAT
CTCGCCGAAA CCTTAATTCT TATCGATCAC AAGCAAAAAC ACGCCGAGAT CATCACTCAC
AACTTCAGTG AAGGCACTGA ACAACATTTA GAGGTGACCC AAGCCTTAGC CGAGCGAGCA
GAGAACATCA GCGCCCAATG TGAAGCCCTA GCCAAGAGTG CAACACCTGC ACCCGCCCTC
GTCGGCATAA CGGCCACAGA GCAAGTGAAT GTTAGTGATG ATGACTTCAA GCAAACCGTT
ATTGATTTAA AAGAACACAT TATTGCCGGC GACATCTTCC AAGTCGTGCC TTCACGCAGC
TTTAGCCTGC CCTGCCCCGA TACCTTGGGA GCTTACCGCG CACTACGTTT AACCAACCCC
AGCCCCTATA TGTTTTATTT CAGGGGCTAT GACTTCACCC TGTTTGGCGC CTCGCCCGAG
AGCGCGCTGA AATATGAAGC CAGCAATAAT CAGGTCGAAG TCTACCCGAT TGCAGGCACC
CGTAAACGCG GCAAAACCGC CAGCGGCGAG ATCGATTTCG ACCTCGATAG CCGTATCGAA
CTCGAATTGC GTTTGGATAA AAAAGAGCTC TCCGAACATT TAATGTTGGT CGATTTAGCC
CGTAACGATA TCGCCCGTAT CAGCCAAAGC GGCAGCCGCA AAGTGGCCGA GTTACTTAAA
GTCGACCGTT ATTCCCACGT CATGCACTTA GTCAGCCGCG TCACAGGTCA ACTACGCCAA
GACTTAGATG CACTCCACGC CTATCAGGCC TGTATGAATA TGGGCACGCT AGTTGGCGCC
CCGAAAGTGC GCGCCTCTCA ATTGGTGCGT CAAGCAGAAA AGACCCGCCG CGGCAGCTAC
GGCGGCGCTG TGGGTTACCT CAACGCCCTT GGGGATATGG ACACCTGTAT CGTTATCCGC
TCCGCCTTTG TGAAAAACGG CGTTGCCCAT ATCCAAGCGG GTGCTGGCGT GGTGTTTGAT
TCCGATCCGC AGAGTGAAGC CGATGAAACC CGCCAAAAGG CGCAGGCTGT GATTTCGGCC
ATCAAGATGG GCGCAGGCTT GGATGAAAGC CAGCAAGCAA CTGCGACCAC TACGACTGAA
CAACAAAGAT AA
 
Protein sequence
MTLKTFNQVT QVDGAHFAPS QQTFARSHTL KATLAYHSDP LRLYQHITQD VPHTMLLESA 
EIDSKENLKS MVMTHAALMI RCDGYRLRFS ALSDNGVSLL APIEQFFIAR SSQTQCQRDG
HNLVVTLQKD TELKDEDARL KSTSPLDGLR LFVKHIDCGQ TPAFEDLFLG GVLSYDLIDT
VEPLPEAPNG ANDCPDYLFY LAETLILIDH KQKHAEIITH NFSEGTEQHL EVTQALAERA
ENISAQCEAL AKSATPAPAL VGITATEQVN VSDDDFKQTV IDLKEHIIAG DIFQVVPSRS
FSLPCPDTLG AYRALRLTNP SPYMFYFRGY DFTLFGASPE SALKYEASNN QVEVYPIAGT
RKRGKTASGE IDFDLDSRIE LELRLDKKEL SEHLMLVDLA RNDIARISQS GSRKVAELLK
VDRYSHVMHL VSRVTGQLRQ DLDALHAYQA CMNMGTLVGA PKVRASQLVR QAEKTRRGSY
GGAVGYLNAL GDMDTCIVIR SAFVKNGVAH IQAGAGVVFD SDPQSEADET RQKAQAVISA
IKMGAGLDES QQATATTTTE QQR