Gene Tery_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3371 
Symbol 
ID4243466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5168831 
End bp5170021 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content33% 
IMG OID638108356 
Productglycosyl transferase family protein 
Protein accessionYP_722946 
Protein GI113476885 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.547122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAA CTTTTTGCAT AATTGTTAAA AATGAGGAAA AAAATTTGCC ACGATGTTTG 
GCAAGCGTGA AAAATGTAGT AGATGAAATA GTTGTTTTGG ACACGGGTTC AACGGACCGA
ACCCCAGAAA TAGCTCAAGA ATTTGGTGCA AAAGTACATT ATTTTGAATG GTGTAATGAT
TTTGCAGCAG CTAGAAATGA ATCTTTGAAA TACGTTACAG GTGACTGGGT TTTAGTATTA
GATGCTGATG AATATCTCTC ACCAAAAATA GCTCCACATA TTAGACAAGC TATTCAGAGC
GATCGCTATA TTTTAATTAA TTTAATTAGA GAAGAAATAG GAGCACAACA ATCTCCATAT
TCTTTAGTTT CTCGTCTCTT TAGAAATCAT CCTGGGATAA AATTTTCTCG ACCTTATCAT
GCCATAGTTG ATGATAGTAT TTCTGAGATT TTAGCAAAAG AATCTAACTG GCAAATAGGT
TCTTTGTCAG AAATAGCAAT TTTACATGAG GGTTATCAAA AGGGAGAAAT TACTTCTAAA
AATAAACTCG AAAGAGCCAA GGCAGCGATG GAAAGTTTTA TTATAGACTA TCCTAATGAT
GCTTATGTTT GTAGTAAATT GGGAGCTTTA TATTTGGAAG TTGATGAAAG AGAAAAAGCG
ATGGAATTAT TGTTTCGAGG TTTGCAAGAT TCGGAGGTAG ATAAAACAGT TTTGTATGAG
CTACATTACC ACTTAGGAAT TGCCTATAGA CAAAAGCAAG AAAAAGAAAT GGCAAAGCGT
CATTATCAAA TAGCAACAGA GTTAAATATT TTGCCTAAGT TGAAACTAGG AGCATACAAT
AATTTAGGTA GTTTGTTGAA AGAAGAAGGA AATTTAGAGC AGGCAAAAAC TAACTATGAA
ATTGCTTTGC AGATAGACTC TAATTTTACA GTTGGTCATT ACAACAGGGG TATGGTTTTG
AAAGAAATGG GTTGGTTTAC TGAAGCGATC GCCTCTTATC AAAAAGCAAT TCAACTTGAT
GCTAACTATG CAGAAGCTTA TCAAAATTTA GGAGTTGTAT TGTTGAAAGT AGGTCAAGTA
CCAGAAAGTT TAGAAGCATT TGGGAAAGCT ATTAGTTTGC ATGAAAAATA TAACCCTATT
GAAGCTAAAA GAATTAGGCA AGGTTTAGAA TCAATGGGTT TTATGTTTTA G
 
Protein sequence
MNLTFCIIVK NEEKNLPRCL ASVKNVVDEI VVLDTGSTDR TPEIAQEFGA KVHYFEWCND 
FAAARNESLK YVTGDWVLVL DADEYLSPKI APHIRQAIQS DRYILINLIR EEIGAQQSPY
SLVSRLFRNH PGIKFSRPYH AIVDDSISEI LAKESNWQIG SLSEIAILHE GYQKGEITSK
NKLERAKAAM ESFIIDYPND AYVCSKLGAL YLEVDEREKA MELLFRGLQD SEVDKTVLYE
LHYHLGIAYR QKQEKEMAKR HYQIATELNI LPKLKLGAYN NLGSLLKEEG NLEQAKTNYE
IALQIDSNFT VGHYNRGMVL KEMGWFTEAI ASYQKAIQLD ANYAEAYQNL GVVLLKVGQV
PESLEAFGKA ISLHEKYNPI EAKRIRQGLE SMGFMF