Gene Tery_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3331 
Symbol 
ID4243502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5110069 
End bp5111208 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content41% 
IMG OID638108316 
Productvon Willebrand factor, type A 
Protein accessionYP_722907 
Protein GI113476846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAGG ATGATGAAAT TACTCTCCGC ATTAAAGTTA CTGAAGATGG AGAGCGCCCG 
GCAATGTTTC TTGAAGAACA AGATTTTCAG GCAATTGTCG ATAACGACCT GGTAGATATT
ATATCATGGA AAAATCCTGA AGAAAGCGTT CCTCCTCCAG CATGGATTAT AGTTTTGCTC
GACTTTAGCG GGAGTATGAA GGAAAAAGAT AGTAGTGGAA CTACAAAACT TGAGGGAGCG
ATCAAGGCTA CCCGAGAATT TCTGGAAACA ACGAGTGCGC GTGGTAGCAA TACCAGGGTA
GCAATTTTTC CTTTTGGGGA AGGTGGAGGA AGATGTAATA GTTATAAGGT GAGGAGGGAG
AATATCAAAT CTAGATTTTT TCCTGCGGAC GATTTCAAAC ACAAAAACTT ACTAGATAAT
CTGGCCAAGA AAACTCCTTG CGCATCCACT AATATTTATG ACCCTCTCAA AGAAGCTATC
CGCTTGCTCA GCGACCAAGA AGATACGGAC TTTTATGTTC CTGAAGATTC CATAGAACCA
GAACCACGAC TATCAGTGAT TTTGCTGTCT GATGGGTACC ATAACAAAAA ATACGAAAAC
CGGGATTTTA GGAGATTAAT TGCTTTACTT GAGCGTCATG ATCATATTGT GGTTCACACT
TTGGGATACG GTTTAACACA AGAACAGCTT GGAAAAAAGT ATAATCTGGG ACGACCAGCA
ACCCGCGCTG ATGTAAATCA AAAATATGTT CCAGCAGAGG AATTTGTTGA CCAGATGCGT
TTGCAGGAAA TAGCAGAAGT AACAGGAGGA ATCTCAGAAT TCTCTGGTGA TGCTGATGAT
ATTGCCGAGG CGTTGCAGTT ATTTCTCAAT TCTCTGTTAG GAGAATACGA GATTATTTAT
CGGGAACCCA ACCCGGTACG CGCCTCTAAA CATGAGGTTT TTGTGACAGC TACAGTTAAT
GGACAGGATG TAGATACTAC ACAGAAATCT TATACTATTA ATGCTTTTGG GCGATCGCTA
CCAGCAAAAA CCAGATTAAA AATGACGGGT TTAGTGTTGT TGCTATTGGG GTTAGGAGGA
GTTATACCTT TCTGGAATTG GAGTCAGAGA CTCAAGCAAG AAACTGAAGA AGATTTTTAA
 
Protein sequence
MVKDDEITLR IKVTEDGERP AMFLEEQDFQ AIVDNDLVDI ISWKNPEESV PPPAWIIVLL 
DFSGSMKEKD SSGTTKLEGA IKATREFLET TSARGSNTRV AIFPFGEGGG RCNSYKVRRE
NIKSRFFPAD DFKHKNLLDN LAKKTPCAST NIYDPLKEAI RLLSDQEDTD FYVPEDSIEP
EPRLSVILLS DGYHNKKYEN RDFRRLIALL ERHDHIVVHT LGYGLTQEQL GKKYNLGRPA
TRADVNQKYV PAEEFVDQMR LQEIAEVTGG ISEFSGDADD IAEALQLFLN SLLGEYEIIY
REPNPVRASK HEVFVTATVN GQDVDTTQKS YTINAFGRSL PAKTRLKMTG LVLLLLGLGG
VIPFWNWSQR LKQETEEDF