Gene Paes_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1566 
Symbol 
ID6459385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1704669 
End bp1706165 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content51% 
IMG OID642725554 
Productanthranilate synthase component I 
Protein accessionYP_002016231 
Protein GI194334371 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0133412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00994784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATAT CAAGACAGAG CGAAACCTTT ACGCTTCATC CTCTTATCAG CGCTGTCCAT 
GCTGATACCG AAACCCCTGT TTCAGTCTAT CTCAAGCTTC GTGAACCCTA TTCCTGCCTT
CTTGAGTCGG TTGAAGGAGA GGAGCAGCTT GCCCGTTTTT CTATTATTGC GATCGATCCT
GTCGCTGTTC TTAAAGGGAC GGTCAATGGG GATGTTTCTA TAGACATCCG CGATAAGCGG
TTTGAACGCC TGGCAGCGAT TCCCGGTGAG TCGTCCGGTT TGCGCGATGC TGTCGACCGT
TGTCTCGCTC TTTTTAAAAG CAGCGAATTT ACGCCTGACG GTCCTGGCGC ATCACGGATG
ATTACATCCG GAGCATTCGG TTATTTTGCC TACGACACCA TGCATCTGGT AGAACGGATT
CCTTCTGCCC AACTGCCCGA TCCTGCCGAT CTTCCTGACG TATGCCTCCT GTTATGCGAT
AAACTGGTTG TTTTCGACAA TGTCAAGCGC AAGGTGTTTA TCATCGTGAA TTATCTCGAT
GAGGCTGATC GTCCGAGGGC TGAAAAAACC ATGGCAGATA TCAGGGCAAG GATGTTTAAT
CCCTTATCAG CCCGGGAGCT GATGCTTGTA CCCGAAAAGC CTGAGCCGAT TGTCTCTAAT
ACCGAGCGGG AAGCCTATCT GGAGAAGATC CGGGTTGCCA AGGAGTACAT TATGCAGGGG
GATATTTTTC AGGTACAGGT TTCGCAGCGT CTCAAACGCC ATCTCAATTC ACGCCCGTTT
GACGTTTACC GTATGCTGCG AACCATCAAT CCTTCTCCAT ATCTCTATTA TTTCGATATG
GAGGAGTTCA TGATTGTGGG TTCATCTCCT GAACTTCTGG TCAAGGTCGC CGATGATCAC
GACGGCAGAA GGATCGTCGA CACCCGTCCG ATCGCAGGAA CCCGCCCGAG AGGTGCGACA
TGGGAAGAAG ACCAGCGTAT CGAGAAAGAG CTTTTACGTG ACGAAAAGGA GCTTGCCGAG
CACTTGATGC TGATCGACTT GAGCCGTAAC GATATCGGGC GGATAGCTAA AATCGGGACG
GTTGAAACCA ATGAGATGAT GATCATCGAG CGCTATTCGC ATGTGATGCA TATTGTCAGC
AACGTTCGGG GTGAGCTGCG TGACGAATTG ACGCCGATGG ACGCATTCTG GGCATGCTTT
CCTGCCGGAA CCCTGACAGG TGCCCCTAAA GTCAGGGCTA TGGAAATCAT CTATGAACTG
GAGCAGGAGA AACGCGGGCT TTATGGTGGA GCGGTCGGAT TTATCGATTT CTGCGGGCAG
CTTGAGACCG CTATTGCTAT TCGTACCATG GTGGTTCGTG ACGATACAGT CTATTTCCAG
GCGGCCGGTG GTGTGGTTGC CGATTCTCTT GCCGAGAATG AATTCGACGA GACCATGAAC
AAAATGAGAG CGGGGCTCAG AACGCTCGAA GCGCTCGAGC ATGTTTCAAA CGATTGA
 
Protein sequence
MSISRQSETF TLHPLISAVH ADTETPVSVY LKLREPYSCL LESVEGEEQL ARFSIIAIDP 
VAVLKGTVNG DVSIDIRDKR FERLAAIPGE SSGLRDAVDR CLALFKSSEF TPDGPGASRM
ITSGAFGYFA YDTMHLVERI PSAQLPDPAD LPDVCLLLCD KLVVFDNVKR KVFIIVNYLD
EADRPRAEKT MADIRARMFN PLSARELMLV PEKPEPIVSN TEREAYLEKI RVAKEYIMQG
DIFQVQVSQR LKRHLNSRPF DVYRMLRTIN PSPYLYYFDM EEFMIVGSSP ELLVKVADDH
DGRRIVDTRP IAGTRPRGAT WEEDQRIEKE LLRDEKELAE HLMLIDLSRN DIGRIAKIGT
VETNEMMIIE RYSHVMHIVS NVRGELRDEL TPMDAFWACF PAGTLTGAPK VRAMEIIYEL
EQEKRGLYGG AVGFIDFCGQ LETAIAIRTM VVRDDTVYFQ AAGGVVADSL AENEFDETMN
KMRAGLRTLE ALEHVSND