Gene Tery_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4044 
Symbol 
ID4242072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6249775 
End bp6250944 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content39% 
IMG OID638108950 
Productaminotransferase, class V 
Protein accessionYP_723531 
Protein GI113477470 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC GTCCTATATA TCTCGACTGT AATGCAACTA CTCCCCTTGA TGAACGAGTA 
TTAAAAACAA TGTTGCCCTA CTTCACAGAA CATTTTGGCA ACCCCGCTAG CATTACCCAT
CAATATGGTT GGGAAGCAGA AGCAGCAGTG AAAAAAGCTA GAGAAATTTT GGCTACAGGT
ATTAATGCTA GTCCTGAAGA AATTATCTTT ACCAGTGGTG CAACAGAATC AAATAATTTA
GCCATCAAAG GAATAGCTGA AGCTTACTTT AATAAAGGCA AACATATTAT TACGATCACT
ACTGAACATA ATGCAGTTCT CGACCCCTGT GCCTATTTAC AAAATTTGGG ATTTGAAGTA
ACTTATTTAC CTGTAAATAG AGATGGAATT ATCGATATAA CTCGTCTTGA AACAGCTTTG
CGTGATGATA CAATTCTCGT ATCTATTATG GCAGCAAATA ATGAAATTGG AGTCTTACAA
CCCTTAGCAA AAATAGGAGA AATATGCAAA GAAAATTCGA TTATTTTCCA TACTGATGCT
GCACAAGCTA TTGGTAAAAT TTCTCTTGAC GTACAGGCAA TGAATATTGA TTTAATGTCA
TTAACTGCCC ATAAAATTTA CGGACCAAAA GGTATTGGTG CTATCTATGT GCGTCGTCGC
CATCCGAGAG TCAAAATAGC GCCTCAAATA CATGGAGGTG GACACGAACG AGGAATACGT
TCTGGTACTT TGTGTACGCC TCAAATAGTT GGTTTTAGTA AAGCGGTGGC ATTGGCGTTA
GCAGAAATAA AGTCGGAGGC AAAACGGTTA ACTAGTTTAC GACAACAGTT ATGGGAGAAG
TTACAAACAT TAGAAAATAT TTTTCTCAAC GGACATCCGA CTCAGCGTTT ACCAGGAAAT
TTAAACATTA GTGTTGAGGG TGTAGATGGC CAAGCTTTAT TGTTGGGCTT ACAAAGTGTG
ATGGCGGTTT CTTCTGGTTC TGCTTGCACT TCTGCCAAAA TCTCACCTTC CCATGTTTTG
CAAGCTTTAG GGCGTTCAGA AAAGTTAGCT TATGCTTCTG TGCGCTTTGG TATTGGGCGG
TTTAATACTG CCGAAGAAAT AGATCTAGTA GCAGAACAGG CGATCGCCAC AATTAAATCT
TTACGTCAAG CAACTACGAG TATTAAATAA
 
Protein sequence
MFKRPIYLDC NATTPLDERV LKTMLPYFTE HFGNPASITH QYGWEAEAAV KKAREILATG 
INASPEEIIF TSGATESNNL AIKGIAEAYF NKGKHIITIT TEHNAVLDPC AYLQNLGFEV
TYLPVNRDGI IDITRLETAL RDDTILVSIM AANNEIGVLQ PLAKIGEICK ENSIIFHTDA
AQAIGKISLD VQAMNIDLMS LTAHKIYGPK GIGAIYVRRR HPRVKIAPQI HGGGHERGIR
SGTLCTPQIV GFSKAVALAL AEIKSEAKRL TSLRQQLWEK LQTLENIFLN GHPTQRLPGN
LNISVEGVDG QALLLGLQSV MAVSSGSACT SAKISPSHVL QALGRSEKLA YASVRFGIGR
FNTAEEIDLV AEQAIATIKS LRQATTSIK