Gene Paes_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1047 
Symbol 
ID6459898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1151402 
End bp1152784 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content49% 
IMG OID642725047 
Productargininosuccinate lyase 
Protein accessionYP_002015733 
Protein GI194333873 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00526091 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000155334 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGCAAGA AAAAAGAACT CTTGTGGCAA AGCCGCTTTT CAGAGCCGTT TGATCGTGAG 
GCTCTCCTCT TTTCCTCCTC CGTGGATGTC GATAAAGAAC TCTATCAGGA AGACATCACA
GGCTCTATTG CCCATGTCAC CATGCTTTCC GAAGAAGCGA TCATTCCGGC AGAAGAGGCC
AGACTGATTA TCGAAGGTCT GCAGGAGATC GAAGAGGAAA TCAGCACAGG AAGTCTTGTC
CCGCATTGGG AGGATGAGGA TATTCACACC GTGATAGAAA ATCGCCTGAA AGAAAAGATC
GGCCCTATCG CAGGCAAGAT CCATTCAGGC AGAAGCCGTA ACGACCAGGT AGCCACTGAT
ACCAGGCTTT ATCTCAAACG CTCTATCGAA GAGATCCGCC AGGCCCTGAA AGAGCTGAAA
ACTGTTCTGG TGGACAAAGC CGAAGCCTAC CGCAGAACAA TCATCTTCGG TTATACCCAT
CTTCAAAGAG CACAACCTAT TTCAGCAGGG CACTACTACC TCGCCTATTT CAACATGTTC
GACCGGGATA ATCAACGCCT TCAGGACCTT TATAAACGGG TCGATATCTC TCCTCTGGGT
GCTGCTGCAT TTGCAGGAAG CACGTTAGCG CTCAATGCGG AGAGAAGCCG TGATCTGCTC
GAATTTGAGG GGCTGTTTCA CAACAGCATT GACGCCGTCA GTGACCGCGA CATCATTATA
GAATTCGTCT CGGCATGCTC AATTATCATG ATGCACCTGT CGAGATTTGC TGAAGACCTT
ATCCTCTGGA GCTCCTACGA ATTCAACTAT CTTGAAATCA GTGACGCTTT TGCCACCGGT
TCATCGATCA TGCCGCAGAA AAAAAACGCC GATATCGCAG AACTGGTCAG AGGGAAAACA
GGACGGGTCT ATGGCGATCT CATGGCTATG CTGACCATCA TGAAAGGTCT GCCGCTCTCC
TACAACCGTG ACATGCAGGA AGACAAACCG CCGCTCTTCG ATGCATCAAA AACGACACGT
TCATCTGTAC GGATTTTCAC AAAAATGCTC GAAAACACGT CGATTAAAGA GAACCGCCTC
TCATCACTCG TAGCAAAAGA CCTGAGCCTT GCAACGGAAA TAGCCGAATA TCTGGTACAA
AAAAACATGC CGTTCCGAGA CGCTCACCGG GTCACAGGAA AAATAGTCAG CCATGTCATC
GAATCGGGAA CAACGCTTCC TGACATGACT CTCGAAACCT ACCGGACGTT TTCAGACCTC
TTTGACGAAG ACCTCTATGA TGCGCTGAAA CCCGAAGCGA GCGTCAATGC AAAAAAAACC
CACGGGAGCA CATCATTCGC ATCGGTTGAA GAACAGATCG TCTCAGCAAG AACACGGATC
TGA
 
Protein sequence
MSKKKELLWQ SRFSEPFDRE ALLFSSSVDV DKELYQEDIT GSIAHVTMLS EEAIIPAEEA 
RLIIEGLQEI EEEISTGSLV PHWEDEDIHT VIENRLKEKI GPIAGKIHSG RSRNDQVATD
TRLYLKRSIE EIRQALKELK TVLVDKAEAY RRTIIFGYTH LQRAQPISAG HYYLAYFNMF
DRDNQRLQDL YKRVDISPLG AAAFAGSTLA LNAERSRDLL EFEGLFHNSI DAVSDRDIII
EFVSACSIIM MHLSRFAEDL ILWSSYEFNY LEISDAFATG SSIMPQKKNA DIAELVRGKT
GRVYGDLMAM LTIMKGLPLS YNRDMQEDKP PLFDASKTTR SSVRIFTKML ENTSIKENRL
SSLVAKDLSL ATEIAEYLVQ KNMPFRDAHR VTGKIVSHVI ESGTTLPDMT LETYRTFSDL
FDEDLYDALK PEASVNAKKT HGSTSFASVE EQIVSARTRI