Gene Tery_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4101 
Symbol 
ID4245615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6325134 
End bp6326900 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content34% 
IMG OID638109002 
Productpoly-gamma-glutamate biosynthesis protein 
Protein accessionYP_723582 
Protein GI113477521 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.422401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.368838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACA TACCTAATTT GACTCAAAAA TCTCTTTTTG AATTAGCAAG TTCCGGTGAT 
TTTCAGGCAA TTAGTCAATG GATTAATAAA AAACTTAAAC CTCAAGGAAT TTCAGCTCGT
ATAGCTAAAG AAAATACTGG ATATCTAGAA GTTTTAGTAG AGTTTCAGAC TCAACCTCCT
GTAGATAGAT TAATTAAGTT TATCTGTTAT CAACTTTCTC AACTTAACTA TCCTACACTA
GAAAAAGTAA AAATTGTGGG GCGTTTAAGT GGTTCACCTA ATATACTATG GAAACACTCT
GTCAGAATTA ATTCTCATGC AAAAAAAAAT TTCAATAAGC AATCTAATTT TGTAAACAAA
AGTGATAATT TGCAATTTCA AACATTTCGT TATTTGATCT TACTCAGTTC AGCGGTTGCA
GCTTTCATTA TAGGTATTTT AGTAAGTTAT TATAGTGTTT TGGTACGAAA TTCAACCTCA
GGTCAATGGG TAGAAACTGC TATAGAAAAG GTGAGAGTTG TGGAGCATAG AAAGGTACAA
AACTCCCAGG ATCCTATGGT GACTTTAATG TTTGGTGGAG ATGTTAATTT ATCTAACCAA
GTTTCTAATT TAGTAAAGAG AGATTATAAG TTACCTTTTG CTAAAATGAA TGAGTATAGG
GCTGCAGACT TATCAATAGT TAACCTGGAA AGTCCTTTGA CCCGTTCTAC TCTCAACAGT
AGAACTCAGC AACAAAAATC AACGGTAAAT CCTAGTTATG TTAAGGCATT AACCTCAGGA
GGAGTTGATC TGGTAAATTT AGCTAATGAC CATACTTTGG GTTATGAGCA AAAAAGTTTG
TTAGAGACAA TAGAAACTTT AGAGAATGCG GGTATTCATT CTTTAGGAGC GGGCAAAACA
GAAGAAGAGG CTAGAAGGCC AAAAATTTTT GAAGTTAAAG GCCAAAAGAT TGCATATCTC
AATTACTATG ATACAGATAT TCAACCAACT ACTGAATCAG TATATGTAAA TAGTCGGAAT
AAGGATAGGC TCTCTTCAGA TATTCAAATT TTGAAGAAGC AGGTAGACTG GATAATTGTT
AATTATCATT GGGGGGTTCA ACTCTCAGAA TATCCTGGAG ATTGGCAGAT GAATATAGCG
AGGATGACAA TTGACCAAGG TGCTGATTTG GTAGTAGGAC ATCATCCTAA AGTATTGCAG
GGGGCAGAAA TTTATCGGGG ACGACCTATT ATATATTCTT TGGGAAATTT TATTTTTGGA
GACACTTCTA ACAAAGAGAG TGATTATGAC ACAGCAGTTT TGAAGGTATC TTTAAAACCA
GGAAAAATGA AGATTGAGTT TTTGCCTGTA GTGGTTAGTA AGTACCAACC CCACATTGTC
AAAGGTGAAA AAGGTAAAGA AATTCTTAAA CACATTGCTC AAATTTCTAG TATTTTTCAC
CAGCCAATGA GAACTCCTAT AATAATAAAT ACGATAAATG ATGATTTTAA TTTTGTTGGT
ATTGACTCTT TTCCTAGGGA AGAAAATTCT AAAACTTTCT CAACTCCAAT TTTACCTGAG
TTACCTCTAA AATCTCCACA AGCTGATCCA AATCCTACAA GCTCTTCTCA TAATAATTCA
GAGCAAGAAG CAAGTAATAA TAATAATAGC TTTTCTTTAC CACCAATATT AAGTCCTGCA
CCCACTCCTA AAGAAAGAAT AGATCCTTTC ATTAAAAAGC CATTTATCAA AGAACCTTTT
ATTGAATTGC CTCGTTTACA AATTTAA
 
Protein sequence
MNYIPNLTQK SLFELASSGD FQAISQWINK KLKPQGISAR IAKENTGYLE VLVEFQTQPP 
VDRLIKFICY QLSQLNYPTL EKVKIVGRLS GSPNILWKHS VRINSHAKKN FNKQSNFVNK
SDNLQFQTFR YLILLSSAVA AFIIGILVSY YSVLVRNSTS GQWVETAIEK VRVVEHRKVQ
NSQDPMVTLM FGGDVNLSNQ VSNLVKRDYK LPFAKMNEYR AADLSIVNLE SPLTRSTLNS
RTQQQKSTVN PSYVKALTSG GVDLVNLAND HTLGYEQKSL LETIETLENA GIHSLGAGKT
EEEARRPKIF EVKGQKIAYL NYYDTDIQPT TESVYVNSRN KDRLSSDIQI LKKQVDWIIV
NYHWGVQLSE YPGDWQMNIA RMTIDQGADL VVGHHPKVLQ GAEIYRGRPI IYSLGNFIFG
DTSNKESDYD TAVLKVSLKP GKMKIEFLPV VVSKYQPHIV KGEKGKEILK HIAQISSIFH
QPMRTPIIIN TINDDFNFVG IDSFPREENS KTFSTPILPE LPLKSPQADP NPTSSSHNNS
EQEASNNNNS FSLPPILSPA PTPKERIDPF IKKPFIKEPF IELPRLQI