Gene Tery_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3334 
Symbol 
ID4243505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5113872 
End bp5115197 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content41% 
IMG OID638108319 
Productvon Willebrand factor, type A 
Protein accessionYP_722910 
Protein GI113476849 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0867289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACG TAACCATTAC CCCCCATCGA GAATTTCTAG CGGCTGATAC CCCTGGACAA 
AAGCTGTTTG TCATGTTAAA ATTACGTCCA AACGCAATTG TTTCAGCAAG TCGTCCTTCA
ACAACCTTCA CCTTTGTTAT TGATACCAGT GGTTCAATGT ATGATGATAG TGAAGTGGGG
AGGCCGAAAA TTGATATTGT TGTTGAAGCT CTCGAACGCT TAGTTACTGA TATACAAGCA
GATCCTCGCG ATCGAATTGC CCTAGTACAA TTTGATGATT CAGCATCAGT TTTGTTGCCC
TTGACTGCTG CCACAGATAC TGTTACTCTC CAAAATGCCA TCTCCAAATT ACGAAGTTTT
AGTGGTGGAA CAAGGATGGC TTTGGGAATA GAAAAATCCC TGAATTTATT GAAAGACTCT
GTTCTCAGTA GTCGTCGCAC TCTCATTTTT ACTGATGGAC AGACAATAGA TGAAATTGAC
TGTCGAGAAC TAGCGGTACA ATTTGCCCAA GCTGGAATTC CTATTACTGC TCTCGGTGTT
GGTGACTACA ATGAAGACTT GTTAGTCTAT TTGAGTGATC ACACTGGGGG TCGCGTTTTT
AATGTTGTGG AACAAGCCAG TAATACTGGA ACCACAGATA TAGCAATTTC TGAGCTGCCA
CAGACAATTT TTCAAGAAGT ACAACAGGCT CAAGCTGAGG TCATTAATAA CCTCAAGCTT
AGTGTTCGTA CTGTCAAAGG GGTTAATTTA CAAAGACTTA GCCGTGTTTA TCCAGACCGC
GCTGATATTC CTGTTACTCA AGAACCTTAT CTCATCGGCA GCGCTCTCGC TAATGACGAT
ACTATTTTTA TTCTTGATTT TGATATTGAT AGCAGAGCTC AATCACGGGT TCGTATTGCT
CAATTAGGTT TAACTTACGA CATTCCCGGT CAGCAGCGAC GAGGAGAACT ACCCCCTCAA
AATCTTGTTA TTCAGTTGGT TGCCGGAAAA GGTGGAATTG CCCAAACAAA TCCGGAAGTC
ATGGGATATG TACAACAGTG TAACATTGGT CAATTAGTCG ATCATGCAGC AGCAGTAGCT
GATAGCAACC CTGATGAAGC AGCAAAACTT TTGGAAACAG CAAAACGAGT AACTGTCAAA
ATTGGAAATG AAGCCATGTT AAAAACTCTC AATCTTGGTA TTGAAGAAGT ACGCAAAACC
CGTAAACTGT CTTCAGGAAC CCGCAAGACT GTAAAAATGG GTGCTAAGGG TAAAACTGTA
AAAATGAGCG ATAGTCCTAA TGATCAACTT TCAGAAGAAC AAATCCGTAA TATGACAGGA
ACCTAG
 
Protein sequence
MLDVTITPHR EFLAADTPGQ KLFVMLKLRP NAIVSASRPS TTFTFVIDTS GSMYDDSEVG 
RPKIDIVVEA LERLVTDIQA DPRDRIALVQ FDDSASVLLP LTAATDTVTL QNAISKLRSF
SGGTRMALGI EKSLNLLKDS VLSSRRTLIF TDGQTIDEID CRELAVQFAQ AGIPITALGV
GDYNEDLLVY LSDHTGGRVF NVVEQASNTG TTDIAISELP QTIFQEVQQA QAEVINNLKL
SVRTVKGVNL QRLSRVYPDR ADIPVTQEPY LIGSALANDD TIFILDFDID SRAQSRVRIA
QLGLTYDIPG QQRRGELPPQ NLVIQLVAGK GGIAQTNPEV MGYVQQCNIG QLVDHAAAVA
DSNPDEAAKL LETAKRVTVK IGNEAMLKTL NLGIEEVRKT RKLSSGTRKT VKMGAKGKTV
KMSDSPNDQL SEEQIRNMTG T