Gene Tery_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4134 
Symbol 
ID4245648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6377943 
End bp6379145 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content42% 
IMG OID638109035 
Productaminotransferase, class V 
Protein accessionYP_723615 
Protein GI113477554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0486971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAA CTATTTATCT AGACAACAAC GCTACCACTA AAGTTGATGA GGCAGTGTTA 
GAGGAAATGC TGCCTTACCT GAGTCAGTTT TATGGTAATC CCTCTAGTAT GCACACTTTT
GGCGGAAAAG TTGGTAAGGC TACGAGAAAA GCGCGATCGC AAGTAGCAGC TTTACTCAAC
GCCGAAGATA CAGAAATTAT TTTTACCAGT TGTGGTACTG AGGGTGATAA TGCTGCTATT
CGAGCTGCTC TTACTGCTCA ACCTAATAAA CGGCATATTA TTACTACTCA AGTAGAACAC
CCAGCAGTTT TGAGTCTCTG TAAGTATTTG GAGAAGCAGG GGTATACAGT TACTTATCTA
TCAGTAGATA GCCAGGGAAT GATAGATTTA ACTGAACTGG AAGCTGCTAT TACAGGTAAT
ACTGCTCTAG TTTCAGTTAT GTATGCCAAT AATGAAACAG GGGTAGTTTT CCCCATAGAG
AAAATTGGGC AGATAGCTAA AGAATATGGT GCGCTGTTCC ATGTTGATGG GGTACAAGCA
GTAGGAAAAG TGCCTTTAGA TATGAAAAAT AGCACTATAG ATATGTTAGC TTTGTCTGGT
CATAAGTTGC ATGCACCTAA GGGTATTGCT GCTTTGTATG TGCGCCGCGG TACTCGCTTC
CGTCCGTTAT TAATTGGGGG ACATCAGGAA CGAGGTCGCC GTGCGGGGAC TGAAAATGTA
CCTGGTATTA TTGCTCTGGG TAAAGCCTGT GAGTTGGCAA AGGAGCATTT GGCACATATT
GACAAGGAAC GGGAACTGCG CGATTTGTTA GAAAATGGTA TTTTGAATAC TATTCCTGAT
TGTGCTGTTA ATGGTCATCT TATTCAAAGG TTGCCTAATA CTACTAATAT TGGTTTCAAG
TATATTGAGG GAGAAGCAAT ATTGCTGCAT TTGAGCCATT ATGGTATCTG TGCTTCTTCA
GGTTCGGCTT GCACTTCTGG ATCTCTTGAG CCTTCACACG TTTTGCGGGC AATGGGTCTG
CCGTATACAG TTTTACATGG TTCAATTCGT TTTTCTCTAT CACGTTATAC AACTCGCCAA
GAAGTTGAGG AAGTACTGGC GGTGATGCCG AGCATAGCAG AACGTTTGCG TGCTTTATCT
CCCTTTAAGA ACGATCAAGC TGATTGGTTG CAAGAGCGAC AACAAGCAAC TGTATCCTCC
TAA
 
Protein sequence
MQQTIYLDNN ATTKVDEAVL EEMLPYLSQF YGNPSSMHTF GGKVGKATRK ARSQVAALLN 
AEDTEIIFTS CGTEGDNAAI RAALTAQPNK RHIITTQVEH PAVLSLCKYL EKQGYTVTYL
SVDSQGMIDL TELEAAITGN TALVSVMYAN NETGVVFPIE KIGQIAKEYG ALFHVDGVQA
VGKVPLDMKN STIDMLALSG HKLHAPKGIA ALYVRRGTRF RPLLIGGHQE RGRRAGTENV
PGIIALGKAC ELAKEHLAHI DKERELRDLL ENGILNTIPD CAVNGHLIQR LPNTTNIGFK
YIEGEAILLH LSHYGICASS GSACTSGSLE PSHVLRAMGL PYTVLHGSIR FSLSRYTTRQ
EVEEVLAVMP SIAERLRALS PFKNDQADWL QERQQATVSS