Gene Tery_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0156 
Symbol 
ID4241749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp230297 
End bp232696 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content34% 
IMG OID638105504 
Producttransglutaminase-like 
Protein accessionYP_720123 
Protein GI113474062 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.973257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0989488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATT ATTTTTCTGG ATGGAAAGTG CGAAATCTAC CTATATTAGG ACTTCTGTGG 
AAACAAGTAC AAGCAGCACC TAAACCTATT GCTGAAGATT CTATTCTATT ACGAGTAATG
GTACAGGCAT TGGTAATAGT AGGTATTATA GCAATTGACG TGGCTGCTGG GGTAGCAACA
GATGAACTAC CTTTAACTGT TTGGGCAGGA CCACTAAGTA TTATAGGAGC AATTTGGAGT
TGGTATAATC GCTACAAAAG AAATGTGCTA GCTAAATTCT GCTTGGCGAT CGGAATGTTA
TTAATTTTTG CTGCTTTTTT GAGTAGATTA TTTGGTGAGC TAAATGATAC ACGACTGGTA
TTGGCTGAAT TATTAATTCA ACTTCAAGTC TTACATAGCT TCGATCTACC TAGGCGAAAA
GACTTAGGTT ATTCAATGGT TATAGGGCTA ATTTTGTTGG GGGTTGCAGG CACTATTAGC
CAAACTTTAA CTTATGGACC TGTATTAATA GTGTTTACTA TATTTGCTAT TCCTTGTCTT
GTTTTAGATT ACCGTTCTAG GTTAGGTTTG TACTCAAGAT TGTCCAGTAA TAAATTAGAG
CTTCGAAGGC AAAAACAGCA AAAAAAAATA TCTCTGTTAA ATGCTACTTT GTCTCCCAAG
CGATTAACAA TATTATTGTT AGTAATTCTT GGTTTGGGCA TGACAATTTT TGCTTTTTTA
CCAAGGTTAC CTGGATATCA AATACGAACT TTTCCTGTGA GCAGTACGAT TAATTTTGAT
TTAGAAAATT TCGATGGGCG AACAATTACA AATCCTGGTT ATATTAGTCA AGGTAGGGGA
AATAGAGATG GTTCTGGAAA TAGTAATGGA ACTAACCAAG AATCAGGTCC TGGAAGTTTA
AATGATACTT ATTATTACGG GTTTAATACT AAAATTAACC AAAATTTGCG TGGGCAATTA
GTGCCTCAAG TTGTGATGCG AGTTAGGTCG CAAGCTGAAG GTTTTTGGAG AGTTTTAGCA
TTTGATAAAT ATACTGGTGA AGGTTGGGAA ATTTCTCGAA ATGATAAGAC TATTACTTTA
GATCGTAGCA GTTTTTCTTA TCAATTTTCT GTGGGTTTAC AGAGAAAGAA TTATAAAAAT
GAAACTAAAG AAGTCATTCA AAATTATACA GTAGTTTCTC CGCTACCAAA TTTAATTCCA
GCTTTATATC AGGCCAAAGA AGTATATTTT CCTACTAAAA AAGTAGCCAT TGACCTTGAT
GGAAATTTAC GTTCTCCTCT TGAACTTAGA GAAGGTTTAA CCTATACAGT AATTTCTAAT
GTTCCTTATC GAAATCGTAG TATTTTGAGT CAAGCGGGAA CAAATTATCC AGAAAGTATT
AGTGAATATT ATTTACAGAT TCCTTCTGAA ATACTAGAAA AAGTCAGACA AAAAACAGAA
GATATTTTAG CAAATAAAAA TAAAGTTGCT CAAGAAATTA AACCTATTAT TTCTCCTTAC
GAAAAAGCTC TTTATTTAGC TCAATACCTT AAGCAAAACC CAAATTATAA ACTTCAAAAA
GCACCACCAT TTTTAGCAGA AGATGAAGAT TTAGTCTCAG CTTTTTTATT TGGTTACCAA
AATAGCCGGG AAGGGGAAAA AATTACGGGT GGTTATGGCG ATCATTTTTC TACTGTTTTG
ACAATAATGT TACGTTCTAT TGGAATTCCT GCGAGATTAG TTGCCGGTTA TGCTCCTGGT
GAATTTAATC CTTTTACTGG TTTATATGTG GTCAAAAATA CAGATGCCTA TATGATGACG
GAGGTATATT TTCCGGGGTA TGGGTGGTTT GCTTTTAATC CTATTCCGGG AATGCCATTA
ATTCCTCCTT CCATTGAAAA AAATCAAAGT TTTACTGGGT TAATGGTTTT TTGGAAGTGG
GTAGCTGGTT GGTTACCTTC TCCTGTAACT GGTTTTTTGG AGAGGTTTAT TGGTGGAATA
TTCCTTTGGA TATTGGGAGT TATTGCTTGG TTTTTTGCTT TATTTTCTCA AGGTTGGCAA
GGCATATTTA CTGGTTTAAT ATTATTTACT GCTTTAGGTT TTTTGGGTTG GTTATTATAT
ATTTTTTGGC AAAAATGGCA TTATGGTCGT TGGTTAGGAA GGTTATATCC TATGGAGAGT
ATCTATCAAC AAATGTTGAA TTTGATGGCG AGTAAGGGAT TAGTTAAACA TGGTTTTCAG
ACCCCTTTTG AATATACTAA GGCTATTAAG GGATATAACT CTAGGAATGA AGCAGAAATA
ATAGAAGAAA TTTCGCAAGC TTATGTAGAA TGGCGTTATG GTGGTAAGGA GGTTAATTTG
AATAGATTAA AAGGTTTACT GCGGAATTTA AAGAGTAGTA TCAGGCGGAA ATTAAATTAA
 
Protein sequence
MSNYFSGWKV RNLPILGLLW KQVQAAPKPI AEDSILLRVM VQALVIVGII AIDVAAGVAT 
DELPLTVWAG PLSIIGAIWS WYNRYKRNVL AKFCLAIGML LIFAAFLSRL FGELNDTRLV
LAELLIQLQV LHSFDLPRRK DLGYSMVIGL ILLGVAGTIS QTLTYGPVLI VFTIFAIPCL
VLDYRSRLGL YSRLSSNKLE LRRQKQQKKI SLLNATLSPK RLTILLLVIL GLGMTIFAFL
PRLPGYQIRT FPVSSTINFD LENFDGRTIT NPGYISQGRG NRDGSGNSNG TNQESGPGSL
NDTYYYGFNT KINQNLRGQL VPQVVMRVRS QAEGFWRVLA FDKYTGEGWE ISRNDKTITL
DRSSFSYQFS VGLQRKNYKN ETKEVIQNYT VVSPLPNLIP ALYQAKEVYF PTKKVAIDLD
GNLRSPLELR EGLTYTVISN VPYRNRSILS QAGTNYPESI SEYYLQIPSE ILEKVRQKTE
DILANKNKVA QEIKPIISPY EKALYLAQYL KQNPNYKLQK APPFLAEDED LVSAFLFGYQ
NSREGEKITG GYGDHFSTVL TIMLRSIGIP ARLVAGYAPG EFNPFTGLYV VKNTDAYMMT
EVYFPGYGWF AFNPIPGMPL IPPSIEKNQS FTGLMVFWKW VAGWLPSPVT GFLERFIGGI
FLWILGVIAW FFALFSQGWQ GIFTGLILFT ALGFLGWLLY IFWQKWHYGR WLGRLYPMES
IYQQMLNLMA SKGLVKHGFQ TPFEYTKAIK GYNSRNEAEI IEEISQAYVE WRYGGKEVNL
NRLKGLLRNL KSSIRRKLN