Gene Tery_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1984 
SymbolnadE 
ID4243411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3093311 
End bp3095065 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content33% 
IMG OID638107101 
ProductNAD synthetase 
Protein accessionYP_721708 
Protein GI113475647 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.187378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CAATTGCTCA ACTTAATCCT GTGATTGGTG ATATTTCAGG CAATGCCAAA 
TTAATTTTGG ATGCTGCACA AAAAGCAAAA AAATTAGATG CTAAGTTGAT GATAACTCCA
GAATTATCAT TAATAGGTTA TCCCCCACGA GATTTATTAA TTTATCCTAG TTTAATTGAA
GCTGCAGTTC TAGAATTAGA AAATTTAGCC AAATATTTAC CATCAGAAAT AGCCGTTTTA
GTAGGAACTG TAACCTTTAA TTATCAAGCC GCTAATACAG GAGAAAAATC ACTATTTAAT
AGTGCAGTTT TATTAACCAA TGGCGAAATC AAACAAGTAT TTCATAAACA ACTTTTACCT
ACTTATGATG TATTTGATGA AGACCGATAT TTTGAACCTG GTAAAACTAG GGATTTTTTC
ACGTTAGAAA ATTATTCTAA CAGCTCAGAA AATTTAGCTA AAGTAGGTGT AACTATCTGC
GAAGACTTAT GGAATGATGA GGCTTTTTGG GGAAAACGAA ATTATGCTTA TGACCCGATG
AAAGAATTAG CAGCACAAAA AGTTGATTTT GTGATTAATA TGTCAGCTTC TCCCTATCAA
ACTGGAAAAC AAAAATTGCG AGAAGCAATG TTAAAACATA GTACAAATTG TTATCAAATA
CCAATTATTT ATGTCAATCA AGTGGGTGGT AATGATGATT TAATTTTTGA TGGTTGTAGT
GTAGTTTTTA ATGGTGCTGG AAATGTGGTT TATCGTGCTC AAGCTTTTGA GACAAGTTTG
GCAGTTGTAG AATTTAATTC AGCAAAAAAA GATTTCATTT CAGTAGATTT TAAAAGTATA
AATTTGCCAG AAAGTGAGGA TGAAGAAATT TGGTCTGCTT TGGTTTTAGG CCTAAGAGAT
TATGTACAAA AGTGTGGTTT TTCTAAAGTT GTTCTGGGTT TAAGTGGAGG AATAGACTCA
GCTTTAGTTG CAGCGATCGC TACTGCTGCA TTAGGAAAAG AAAATGTCTT TGCTATTTTG
ATGCCTTCTC CCTACAGTTC TGAGCATTCG GTAAAAGATG CTTTAGAATT AGCAGAAAAT
TTGGGTATTG CTAAACAAAT TATATCTATT GAGAATTTAA TGAAGGATTA TGATAATAGT
CTGTCAAGTT TATTTACAGG TACAAATTTT GGTATTGCTG AAGAGAATAT TCAATCTCGG
ATTCGTGGAA ATTTATTAAT GGCTATTTCT AATAAGTTTG GTTATTTACT TTTATCTACA
GGCAATAAGT CAGAAATGGC TGTTGGTTAT TGTACTCTTT ATGGTGATAT GAATGGTGGA
TTAGCAGTAA TTTCAGATGT GCCGAAAACT CGGGTTTATT CTTTATGTCA GTGGTTGAAT
GAACAGACAG TTAATAACAA TAAAAAATTC TCTGGATCCC AAAACTTACT AATGACTGAA
AAGCAAAATA TTATTCCCAA AAATATACTG ACAAAAGCTC CCAGTGCTGA GTTAAAAGAA
GGTCAAAAGG ATGAGGATTC TTTACCTGCT TATGAAGTCT TAGATGATAT TTTATTTAGG
TTAGTAGAAA AGTGCGAATC TTTAGACAAA ATTATTGCTG CAGGACATGA TTTAGAGGTG
GTAAATAAGG TAGTAAAATT AGTCATGAGG GCAGAATTTA AACGTAGACA AGCACCCCCA
GGTTTGAAAA TTAGTACTCG CGCTTTTGGC ACAGGTTGGC GGATGCCTAT TGCTAAAAAA
TTAGTTATTA ACTGA
 
Protein sequence
MKIAIAQLNP VIGDISGNAK LILDAAQKAK KLDAKLMITP ELSLIGYPPR DLLIYPSLIE 
AAVLELENLA KYLPSEIAVL VGTVTFNYQA ANTGEKSLFN SAVLLTNGEI KQVFHKQLLP
TYDVFDEDRY FEPGKTRDFF TLENYSNSSE NLAKVGVTIC EDLWNDEAFW GKRNYAYDPM
KELAAQKVDF VINMSASPYQ TGKQKLREAM LKHSTNCYQI PIIYVNQVGG NDDLIFDGCS
VVFNGAGNVV YRAQAFETSL AVVEFNSAKK DFISVDFKSI NLPESEDEEI WSALVLGLRD
YVQKCGFSKV VLGLSGGIDS ALVAAIATAA LGKENVFAIL MPSPYSSEHS VKDALELAEN
LGIAKQIISI ENLMKDYDNS LSSLFTGTNF GIAEENIQSR IRGNLLMAIS NKFGYLLLST
GNKSEMAVGY CTLYGDMNGG LAVISDVPKT RVYSLCQWLN EQTVNNNKKF SGSQNLLMTE
KQNIIPKNIL TKAPSAELKE GQKDEDSLPA YEVLDDILFR LVEKCESLDK IIAAGHDLEV
VNKVVKLVMR AEFKRRQAPP GLKISTRAFG TGWRMPIAKK LVIN