Gene Tery_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4217 
Symbolddl 
ID4245869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6502711 
End bp6503811 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content38% 
IMG OID638109113 
ProductD-alanyl-alanine synthetase A 
Protein accessionYP_723691 
Protein GI113477630 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.494789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAT TACGAGTAGG TTTATTATTC GGTGGTTGCT CTGGCGAACA TGAAGTATCT 
ATTCGCTCTG CCAAAGTGAT CGCTACTGCT TTAACTAAGG TTCAAAATAA AGAGAAATAT
GAACTAATAC CAATTTACAT TCAAAAAAAT GGTCTTTGGC AACCAAGTGA TTTTTCTCAA
AAAGTACTAA ACTCTGATCA CCCATCATTG TTACAATTGA CAAATGAAAA TAATCATGAC
TTAAATACAT CCTTAACTCA ACAAAGCTCT TCTCCTCTCT GGCAAATTCC TCCTCAAGCA
GCACAGGTAG ATGTTTGGTT TCCCATTCTT CATGGTCCGA ATGGGGAAGA TGGCACCATA
CAAGGTTTAC TAAAGTTAAT GCAGGTTCCT TTTGTGGGAT CTGGTGTTTT GGGTTCAGCA
ATGGGCATGG ATAAAATAGC GATGAAAATA GCTTTTGACC ATGCAGGTTT ACCTCAAGTA
AAGTATCAGG TAGTAACACG ATCGCAAATT TGGTCGAATT CTTGTGTATT TCCCAAATTG
TGTGATGATA TTGAAACAAC TTTAGAATAT CCTTGCTTTG TAAAACCTGC AAACCTAGGA
TCATCCGTAG GTATTGCCAA GGTGCGATCT CGTTCAGAAT TAGAAACCGC CTTGGATAAT
GCTGCTAGTT ATGACCGTCG AATTATCGTT GAAGCTGGTG TAGAAGCAAA AGAGTTAGAA
TGTGCGGTTT TGGGGAATGA TATGCCAAAA GCTTCTATAG TTGGTGAGAT TACTTATAAT
AGTGATTTTT ATGATTATGA AACTAAGTAT ACAGAGGGAA AAGCAGATTT ACATATTCCG
GCTCGGGTGA GTGAGGCGAT CGCCACTAAA ATCAAAGAAA TGGCAACCCA GGCATTTTTA
GCTGTGGATG CAGCAGGTTT AGCAAGAGTA GACTTTTTCT ACGTGGAAAA AACCGGAGAA
ATTTTGATCA ATGAAATTAA CACAATGCCT GGTTTTACTT CATCCAGTAT GTATCCTATG
CTTTGGGAAG CAAGTGGAAT TCCTTTTTCA GAGTTAGTAG ATACTTTAAT TCAGTTAGCT
TTAGAAAGAC ATTCTAAGTA A
 
Protein sequence
MTKLRVGLLF GGCSGEHEVS IRSAKVIATA LTKVQNKEKY ELIPIYIQKN GLWQPSDFSQ 
KVLNSDHPSL LQLTNENNHD LNTSLTQQSS SPLWQIPPQA AQVDVWFPIL HGPNGEDGTI
QGLLKLMQVP FVGSGVLGSA MGMDKIAMKI AFDHAGLPQV KYQVVTRSQI WSNSCVFPKL
CDDIETTLEY PCFVKPANLG SSVGIAKVRS RSELETALDN AASYDRRIIV EAGVEAKELE
CAVLGNDMPK ASIVGEITYN SDFYDYETKY TEGKADLHIP ARVSEAIATK IKEMATQAFL
AVDAAGLARV DFFYVEKTGE ILINEINTMP GFTSSSMYPM LWEASGIPFS ELVDTLIQLA
LERHSK