Gene Tery_3924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3924 
Symbol 
ID4244007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6064632 
End bp6065948 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content40% 
IMG OID638108847 
Producthypothetical protein 
Protein accessionYP_723429 
Protein GI113477368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.647872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00979003 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTAA ATGTCAAAGC TTTCTTTCAG GCAACAAACC CTGGTAAAGC CCTATTCAAA 
GATAACCAGG AAATAGAGGA AAAATACTAT ATTGACTTCT CCTCAGTACG GGGGGGAAAA
ATAATAGAAG ACCTCAAAGA CAATATTGCT ATGTGGTTTC CCGACGAACC TACCTGTCAA
CTATTCACCG GACATATTGG TTGTGGTAAA TCTACCGAAC TCTGGCTACT CAAACAACTA
TTAGAAGCAG AAGGCTTTCA CGTCGTCTAT TTTGAGTCAG ACAAAGACCT AGAAATGGGA
GATATAGACG TCAGTGATAT TCTTCTAACT ATAGCTCGAC AAGTCATCGA AAGTTTGAAG
ACCCGGGAGA AGCTCAATTT AGGTGAACCC ACAAGATTCA AACGTCAAAT TGAGGGAGCT
ATGAGACTAT TGCAGACAGA AATAGAAATA TCTCCTGGAG TTGAGCTTTC TTTTGGAATT
GCCAAAATAA CTGCTCAGGC CAAAGCTAGT CCCCAACTCC GCAGCAAACT CAGAGACTAT
CTCGGGCCTC GTAGCAATGG AATTATTGAA ACAATAAACC AAGAGTTACT CGAACCAGCT
CACGAAAAGC TGAAGCAGCG CGACAAAAAG GGATTAGTAG TTATAGTTGA CAACCTTGAT
AAAGTTGATA GTGCCCCAAA ACCTTGGGGG CGATCTCAAC CAGAATATCT ATTTGTTGAT
CGCGGCGAAC AACTAGCAAG TCTCCATTGT CATGTAATTT ATACTCTACC TCTAGCACTG
CGATTTTCCA ATGACTATAA TAGATTAACT CAACGCTTCA AAACCGACCC CCAAGTCTTG
CCAATGGTTT CTATGCAGTT GCGGGATGGT AAGGAATTTG GAGAAGGAAT GGCAAAACTC
AGGCAGTTAG TTTTAGCAAG AGCATTTCCC CATTTGGGAG AACAACAACG TCTAGAGAAA
ATAACTGAAA TTTTTGACAG TACCGAAACT TTAGATCATC TATGTAAAAT GAGTGGGGGT
CATGTGAGAA ATATATTGCG GATACTCAAT GAGGCTATTA AAAAGCAAAA AGGGTTGCCG
ATATCCAGTG AAAACCTGAA CAAGGTAATT CAAAATTTTC GCAATGAACG TACTCTGGCA
GTAGATGATC AAGAGTGGGA GTTGTTGCGC CAAGTAGCAC AAACTAAAAA AGTTACAGGT
GATGATGGAT ATCAAAAGTT GATTCGGAGT ATGTTTGTCT ATGAATACCG AGATGATGAA
GGATCTTGGT TTGATATTAA TCCTATATTG AAAGATGCAG TGGAATTGAA AAAATGA
 
Protein sequence
MSVNVKAFFQ ATNPGKALFK DNQEIEEKYY IDFSSVRGGK IIEDLKDNIA MWFPDEPTCQ 
LFTGHIGCGK STELWLLKQL LEAEGFHVVY FESDKDLEMG DIDVSDILLT IARQVIESLK
TREKLNLGEP TRFKRQIEGA MRLLQTEIEI SPGVELSFGI AKITAQAKAS PQLRSKLRDY
LGPRSNGIIE TINQELLEPA HEKLKQRDKK GLVVIVDNLD KVDSAPKPWG RSQPEYLFVD
RGEQLASLHC HVIYTLPLAL RFSNDYNRLT QRFKTDPQVL PMVSMQLRDG KEFGEGMAKL
RQLVLARAFP HLGEQQRLEK ITEIFDSTET LDHLCKMSGG HVRNILRILN EAIKKQKGLP
ISSENLNKVI QNFRNERTLA VDDQEWELLR QVAQTKKVTG DDGYQKLIRS MFVYEYRDDE
GSWFDINPIL KDAVELKK