Gene Tery_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4203 
Symbol 
ID4245855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6481410 
End bp6482798 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content44% 
IMG OID638109100 
Productamidase 
Protein accessionYP_723678 
Protein GI113477617 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02715] amidohydrolase, AtzE family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.742932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCA AGCTCAACCA AACAGAAGCC GTATCAATGG CCACAGCCAT CAAAGCAGGG 
GAAACTACTG CTGAAATTTT AATAAACAAA TGTCTTGAAC AAATTTATGA GAATAACCGA
ACTTTAAATT GCTTCACCGC TATCACAACA GAAAGCGCAC TTAATGCTGC CAAACAAATA
GATAGAGACA TTTCTCAAGG CAAAAACCCC GGTCTCTTAG CAGGAATACC CTTCGCCGTC
AAAAACCTCT ATGACATTGC AGGTTTAACC ACCCTTGCTG GAGCCAAAAT TAATGCCGAA
AACCCACCTG CTACCCAAGA CGCAACCGCC GTTACCAAAC TAAAAAAAGC AGGTGCAATT
CTGGTTGGTG CCCTCAATAT GGATGAATAT GCCTACGGTT TTGTTACAGA AAATAGTCAC
TACGGTGCAA CACCTAATCC TCATGATCTC AGCCGTATCT CCGGGGGTTC CTCTGGTGCC
TCTGCTGCTG CGGTTGCTGC GGGTTTAGTA CCCATTACCC TTGGTTCCGA TACGAACGGT
TCCATCCGCG TACCTGCTTC TCTCTGTGGT GTTTTTGGGT TTAAGCCGAC TTATGGACGT
TTATCACGAG CTGGCGTTTT TTTGTTTGCC AGTAGTTTAG ATAATGTTGG ACCCTTTGCT
CGCTCTGTAC GAGATATTGC CACAGTTTAT GATATTTTAC AAGGGTCGGA TACAAGAGAT
CCAGTTTGTA CTAAACGTTC TCCTGAAAGT TGTTTACCTC AACTCAAACA AGATATTAAA
GATTTGCGCA TTGCTATTGC TGATGGTCAC TTTGCCCAAG GTGGTGAACC GGAGGTGTTT
ACAGCAGTGG AACAAGTGGC AGAGGTATTG GGTGTCACTC AGCGGGTGAC AATACCTGAA
GCAGATCGGG CGCGAGCTGC TGCTTATATT ATTACTGCGG CTGAAGGCGC AAATTTGCAT
TTGGATAATT TGCGCATCCG TCCCCAAGAT TTCGATCCAG CAACTCGCGA TCGCTTTTTA
GCAGGTGCTT TAATTCCGGC AGACTGGTAT ATCCAAGCTC AACGTTTCCG CCGTTGGTAT
CAAAGTTCTG TTAAGGAAAT ATTTAATGAT GTAGATATTA TTCTAGCTCC AACTACCCCT
TGTATTGCAC CGTTGTTAGG AGCTGAAAAA ATGACTATTA ATGGGGAGGA GGTGTTAGTA
CGTCCGAATT TAGGTTTGTA TACGCAACCT TTGTCTTTTA TTGGGTTGCC AGTTTTGTCA
GTTCCTATTC GACGTATTAA TGGTTTACCT TTGGGAGTAC AAATTATTGC TGCACCTTAT
AATGAGGCTT TGGTATTGCA AGTAGCAGCA GTGTTGGAAT TTGAAGGGCT AACTACTGAA
GTCAAATAG
 
Protein sequence
MTIKLNQTEA VSMATAIKAG ETTAEILINK CLEQIYENNR TLNCFTAITT ESALNAAKQI 
DRDISQGKNP GLLAGIPFAV KNLYDIAGLT TLAGAKINAE NPPATQDATA VTKLKKAGAI
LVGALNMDEY AYGFVTENSH YGATPNPHDL SRISGGSSGA SAAAVAAGLV PITLGSDTNG
SIRVPASLCG VFGFKPTYGR LSRAGVFLFA SSLDNVGPFA RSVRDIATVY DILQGSDTRD
PVCTKRSPES CLPQLKQDIK DLRIAIADGH FAQGGEPEVF TAVEQVAEVL GVTQRVTIPE
ADRARAAAYI ITAAEGANLH LDNLRIRPQD FDPATRDRFL AGALIPADWY IQAQRFRRWY
QSSVKEIFND VDIILAPTTP CIAPLLGAEK MTINGEEVLV RPNLGLYTQP LSFIGLPVLS
VPIRRINGLP LGVQIIAAPY NEALVLQVAA VLEFEGLTTE VK