Gene Tery_2700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2700 
Symbol 
ID4244963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4176775 
End bp4178328 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content34% 
IMG OID638107763 
Productpentapeptide repeat-containing protein 
Protein accessionYP_722362 
Protein GI113476301 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0664303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATCG AAGAACTTCA AGAAAAATAT GCAGCCGGTG AAAGAAACTT TACTAATCTC 
AATCTTTTTG AGGCCAATCT CGCTGGAATG AATCTTAGTG GTGTAAATCT TAGTGGAGTT
AACCTTAGTA TTGCCAACCT AAGTGGTACA AATTTAACTA ATTCTAATCT GAGTAAAGCT
AAACTTAATA TTACTAAACT TAATGGAGCT CATCTGAGTG GTACAAATTT AAATGGTGCT
GATTTGAATG TAGCCAACTT AGTTCTAGTT AATTTGAAAA GAGCTCAATT AATTGGTGCT
AAATTGATTA GGGCAGAATT GATTAGAGCG CAACTGAGTG GGGCAAATTT TTCCTTAGCT
GATTTAAGTG GGGCAAGTCT TGTTGAAGCC ACACTCCGAA AAACTAATCT TAGTGGTGCC
AACTTAAAGG GAGCTAACTT ACGTTTTGCT TTTATTACAG AATCTAATTT AATAGAGGCC
AATTTGGAAG GAGCAGACCT GAGTGGTGCA GACCTCAGCG GTTCAGACCT TAGTGGGGCT
GAATTGAGAA AAAGTAATTT GACTGGTGCT AATTTGAATG GTGCTAATTT AAGCGGGGCT
AATCTACGTT GGGCTGTTTT GACTGGCGCT CAAATGAACT GTGTTAATCT TAGTGATGCA
AAACTGAGTG GTGCTGACTT AAGTAGAGCC AACTTGTTCC AAGCTAATCT TTTAAATGCT
AGCCTAGTTC ATGTTAATTT ATCTAGTGCC TGTTTAATTG AAGTAGATTG GATTGGAGCA
GATTTAACAG GAGCAAATTT AACCGGGGCA AAAATTTTTG GTACCTCTCG TCTAGGGATT
AAAACAGAAG CTGTCACCTG TGATTGGCTT GATTTGAGCC CTAATGGAGA TCAACCTCAA
GTTTATACTC TCAATACTCT GAAAATTAAA AGTTTCTTTC ACGAGACTCT TCCTAAAGTG
AAGATAATTG TTGATACTCG CTTAGATATT TCTAGCCATT ATGCTCTAGC TGCTACATAT
TATCAAATTA GCAAGTATTT ACCAGATATA AATCAACCTC CTTGTATAGA AGTTAACTCT
TGTAGAACAA TTATTTCTTT TTCAATGAAC AGCGATGAAA AAATATTTGC TGTTGCTTAT
ATCACTGTTC TACCTTTTCT TGATGGTTAT AAAACTCAAG AAAATATTAT CAAAATCATG
AAGATACTTA ACTCTCATGA TTTCAATTCT TTAACTCCTA TAGTAGCTAA AATAATTCGG
CAGTTAAGTA CAGTTTTAAA TAAATCAATT CAACAAGTAA ATAAAATTAA TAAAGACAAA
GTTTTGTTGA AATTTTGTAG GGGAGTTCAG TTTTTTCAGA CTCCAACAAA AATAACTGTA
ATTAATTCTA GCGGTAAAAA TTTTCATGTC TATCATCACC CTAGATTTGG TAAACGAATA
GTAAATAAAG TTAGTAAAAG TACAGAAGGA GAAGTCAGAA TTCAAGTACC TCAAACAGTA
CCATATTCTA TCAATACAAT TATTGATTTT ATTGATGGCT TTCATAAACC ATAA
 
Protein sequence
MIIEELQEKY AAGERNFTNL NLFEANLAGM NLSGVNLSGV NLSIANLSGT NLTNSNLSKA 
KLNITKLNGA HLSGTNLNGA DLNVANLVLV NLKRAQLIGA KLIRAELIRA QLSGANFSLA
DLSGASLVEA TLRKTNLSGA NLKGANLRFA FITESNLIEA NLEGADLSGA DLSGSDLSGA
ELRKSNLTGA NLNGANLSGA NLRWAVLTGA QMNCVNLSDA KLSGADLSRA NLFQANLLNA
SLVHVNLSSA CLIEVDWIGA DLTGANLTGA KIFGTSRLGI KTEAVTCDWL DLSPNGDQPQ
VYTLNTLKIK SFFHETLPKV KIIVDTRLDI SSHYALAATY YQISKYLPDI NQPPCIEVNS
CRTIISFSMN SDEKIFAVAY ITVLPFLDGY KTQENIIKIM KILNSHDFNS LTPIVAKIIR
QLSTVLNKSI QQVNKINKDK VLLKFCRGVQ FFQTPTKITV INSSGKNFHV YHHPRFGKRI
VNKVSKSTEG EVRIQVPQTV PYSINTIIDF IDGFHKP