Gene Tery_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0740 
Symbol 
ID4243188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1195939 
End bp1197330 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content35% 
IMG OID638106031 
Productargininosuccinate lyase 
Protein accessionYP_720644 
Protein GI113474583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.29697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG ATAGACAAAC ATGGAGCCAA AGATTTGAAA AAGCACTGCA TCCTATGATA 
GCTAGATTTA ATGCTAGTAT TAGCTTTGAT ATTGAATTGA TTGAGCATGA TATAAATGGC
TCAGTAGCTC ATACAAAAAT GTTAGCTAAA ACAGGTATTA TCTCCGCTGA AGAAGGAGAA
AAATTATTGA CAGGTTTAGA ACAAATTCGT CAAGAATATA TAACCGGAAA TTTTCATATA
GTTGAAGATG CTGAAGACGT TCACTTTGCT GTCGAAAAAA GACTGATAGA AATCACTGGA
GATGTAGGCA AAAAACTACA TACAGCTAGG TCTAGAAATG ACCAAGTAGG AACTGATACT
AGGCTTTATT TACGAGAAAA AATTGAACAA ATTCGGTTTT TATTGATCAA ATTTCAAAAA
GTTATATTAG AATTAGCTGA GTCTAATATT GAAACTTTAA TCCCTGGTTA TACTCACTTA
CAAAGAGCGC AACCTTTGAG TTTAGCTCAT CATCTTTTGG CTTATTTTCA TAAGGCGGAA
AGAGATTGGG AAAGGCTAGG AGATGTTTAT CGTCGAGTTA ATATTTCTCC CCTTGGTTGT
GGTGCTTTAG CTGGTACAAC TTTTCCTATT GATAGACACT ATACTGCTGA GTTATTAGGG
TTTGAAAAAC CTTATGCTAA TAGTTTAGAT GGCGTAAGTG ACAGAGATTT TGCTATTGAA
TTTCTTTGCG CAGCTAGTAT AATTATGGTG CATTTAAGTC GTTTGGCTGA AGAAATAATT
GTGTGGTCAT CTGAAGAATT TAGGTTTGTA ACTTTAACTG ATGTTTGTTC CACAGGCTCG
AGTATTATGC CTCAAAAAAA GAATCCTGAT GTACCAGAGT TAGTTAGGGG AAAAACAGGT
CGAGTATTTG GTCATTTACA AAGTATGTTA GTGGTGATGA AGGGTCTGCC ACTAGCATAT
AATAAAGACT TGCAAGAAGA TAAAGAAGGT TTGTTTGATA GTATTAAAAC TGTTAAAAGT
TGTCTGGAGG CAATGACAAT TTTATTGGAG GAAGGTTTGG AATTTAACAG TGATCGCCTA
ACAGAAGCAG TAGCAGAAGA TTTTTCTAAT GCTACAGATG TGGCAGATTA TTTAGCAGCT
CGTGGTGTAC CATTTCGAGA AGCTTATAAT TTAGTAGGTA AGGTTGTGAA AACTTGTATT
TCTGGTGGTA AATTACTGAA AGATTTAACT ATAAAAGAAT GGAAGGAATT ACATCCTGTT
TTTGCAAGTG ACATTTATGA AGCGATAACC CCTTACCAAG TAGTGTCTGC ACGCAATAGC
TATGGTGGAA CTGGTTTTAA ACAAGTAAAA ACAGAAATTG ACGTTGCCAA GCAAAAATTG
GCAGAAAAAT AA
 
Protein sequence
MKQDRQTWSQ RFEKALHPMI ARFNASISFD IELIEHDING SVAHTKMLAK TGIISAEEGE 
KLLTGLEQIR QEYITGNFHI VEDAEDVHFA VEKRLIEITG DVGKKLHTAR SRNDQVGTDT
RLYLREKIEQ IRFLLIKFQK VILELAESNI ETLIPGYTHL QRAQPLSLAH HLLAYFHKAE
RDWERLGDVY RRVNISPLGC GALAGTTFPI DRHYTAELLG FEKPYANSLD GVSDRDFAIE
FLCAASIIMV HLSRLAEEII VWSSEEFRFV TLTDVCSTGS SIMPQKKNPD VPELVRGKTG
RVFGHLQSML VVMKGLPLAY NKDLQEDKEG LFDSIKTVKS CLEAMTILLE EGLEFNSDRL
TEAVAEDFSN ATDVADYLAA RGVPFREAYN LVGKVVKTCI SGGKLLKDLT IKEWKELHPV
FASDIYEAIT PYQVVSARNS YGGTGFKQVK TEIDVAKQKL AEK