Gene Tery_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3133 
Symbol 
ID4244263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4789405 
End bp4790532 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content36% 
IMG OID638108143 
ProductNADH:flavin oxidoreductase/NADH oxidase 
Protein accessionYP_722736 
Protein GI113476675 
COG category[C] Energy production and conversion 
COG ID[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.656366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.303319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAAC TAAAAATACT TGAACCATTT ACACTAGGAG ATTTACAATT ACCTAATCGA 
ATTGTAATGG CACCTTTAAC CAGAAGACGT GCTGATATTA ATAATGCTCC TACCCCATTA
AATGCCTTAT ATTATAGTCA AAGATCTTCT GCTGGTTTAA TTATTAGTGA AGCTAGTCAA
ATTTCTCCCC AAGGTACAAG TTTACCAAAA ACTCCCGGAA TTTATAGCCA AAAGCAAATT
GAAGGGTGGC AACTGGTCAC AAAAGCCGTA CACAATTCTG GTGGTAGAAT TTTTATACAA
TTATGGCATG GTGGACGATG TTCCCATCCT TCTTTACAAC CTAATGGAGA ATTACCTGTT
GCACCTAGCG CGAGGGCCCC GATAGAGGAA AAAGCCTTAA CAGCACAGGA AAAAGAAGTC
CCTTTTGTTA ATCCAAGAAG TCTTTTAACC ACAGAAATAC CTGAAATTAT TGCTCAATAT
CGTCAAGGAG CGATAAATGC TTTAGAAGCA GGTGCTGATG GTGTAGAAAT TCATGGTGCA
AATGGTTATT TACTAGATCA GTTTTTACAA GATAATAGTA ATCAACGCAC TGATAAATAT
GGTGGAAGTA TCGAAAACCG TAGTCGTTTA CTCTTAGAAG TAACTCAAGC AGTAACAGAA
GTTTGGGGCT CGCAGCGTGT AGGAATACGT CTTTCTCCTA GTAGTACCTA TCAAGATATG
TATGATTCTA ACCCAGAGGC TTTATTTAAT TATATAGTAA GCAAAATCGA TCAGTTTAAT
TTAGCTTATC TTCATATTGT TGAGCCTCGA ATAAAAGGCA GTCACGATGA TTTAACTGAA
AAGAAATTAC AACTTGGAGT TAAACATTTC CGCCCTTTAT ATAGTGGAAA TTTAATGACA
GCTGGAGGTT ATACCCGCGA TCTAGGAGAA GAGATAATTA GTCAAGGTTA TACTGATTTA
GTAGCTTATG GAAGGCTATT TATTGCTAAT CCCGATCTAC CTAAACGTTT TGCCTTAAAT
GCACCATTAA ATCCTTATTA TCGTCCTACT TTTACGGGAG GAAATGAGAT AGGATATACT
GATTATCCTT TTCTATCAAT TGATACACTT GCCAAGAATA TAATTTGA
 
Protein sequence
MNQLKILEPF TLGDLQLPNR IVMAPLTRRR ADINNAPTPL NALYYSQRSS AGLIISEASQ 
ISPQGTSLPK TPGIYSQKQI EGWQLVTKAV HNSGGRIFIQ LWHGGRCSHP SLQPNGELPV
APSARAPIEE KALTAQEKEV PFVNPRSLLT TEIPEIIAQY RQGAINALEA GADGVEIHGA
NGYLLDQFLQ DNSNQRTDKY GGSIENRSRL LLEVTQAVTE VWGSQRVGIR LSPSSTYQDM
YDSNPEALFN YIVSKIDQFN LAYLHIVEPR IKGSHDDLTE KKLQLGVKHF RPLYSGNLMT
AGGYTRDLGE EIISQGYTDL VAYGRLFIAN PDLPKRFALN APLNPYYRPT FTGGNEIGYT
DYPFLSIDTL AKNII