Gene Tery_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1569 
Symbol 
ID4242127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2397804 
End bp2398958 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content43% 
IMG OID638106712 
Producthypothetical protein 
Protein accessionYP_721322 
Protein GI113475261 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.339258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT ACGACAAAAT ATCTGACTGG TTAGAAATTA ATTGGGTTAA ACCCGCCTAT 
GGGGGTGGGT TACTCGGAGT ATTGTCAATT TTCTTCTTTG CTGCTGCTGC TAATACTATG
GCAGGATGGT TATACTTAAT TAGTGCCATT ACTTTCGCAC TGTTGGGAGT AAGTGCTCTT
TTGCCTGGGC GATCGCTACG TGAAATAAAA GTACGTCGTG ATCGAATTCA ACCTATAACT
GTGGGTGATA GTTTAGCGAT CGCTTTAACA ATAGAAAATA CTACTAACAA ACCACTGGCT
TTGCTACAAG TCCAAGATGA AATTCCTTTT GTATTAGGGA AACCAGTGCA ACAAGCAATA
GAGGTTATTC CCCCTAAAGA AAGTTACCAT TGGGTTTACT ACCTTCCTAC TAAAAAAAGG
GGTATATATA GATGGCATTA TCTTCAGCTT AGAACTGCAG CACCTCTAGG TCTCTTCTGG
TGTCGGCGTA GTCGAAATGC TAAAGCTACT GCTATTGTCT ATCCCACGGT TTTACCTTTG
AGTCGTTGCC CGATCTTGGA TGAGCTTGGT GAAGAATATA GTCGCCAACT TCATGAACAT
AGTCGTTTTG AAATGGCTTC CCAGGGACTA ACTCGGACAT TAAGACCGTA TCGGTTTGGA
GACTCTAGTC GTTTGATCCA CTGGCGGAGT AGTGCACGTT ATGGGGAGTT GCGGGTTCGA
GAGCTAGAGG TATCTAAGGC TGGTGAAGAA GTTCTAATTT GTCTTGATAG TGCTGCTGAG
TGGCAACCTG ATAATTTTGA GGCTGCGGTG ACTGCTGCTG CTTCTTTATA TTTTTATGCT
AACCGCTCTT TGTTAAATGT TCGACTTTGG ACAGCTGCTA CTGGGTTGGT TTATGGTAAT
TTGGCTATGT TGCAAACTTT GGCTGGGGTT AGGTTTGGGG AAGAGGTGGT GGTGGGAAAC
CCACCGGAAA AGCCTTTGGT TTGGTTGACT CAAAATTCTC TGAGTCTGAG TTCTCTTCCT
CCTGGGAGTA GATGGTTGTT GTGGTTAGAT GAGTCTGTTG GTTATGCAGC AGAAACACCA
AGGATGCAAT ATTGTTCTGG TTTGAATATT AGTTCTGATC AACCTTTGGA GTTTCAACTT
CAGTCTAGTG TTTGA
 
Protein sequence
MKIYDKISDW LEINWVKPAY GGGLLGVLSI FFFAAAANTM AGWLYLISAI TFALLGVSAL 
LPGRSLREIK VRRDRIQPIT VGDSLAIALT IENTTNKPLA LLQVQDEIPF VLGKPVQQAI
EVIPPKESYH WVYYLPTKKR GIYRWHYLQL RTAAPLGLFW CRRSRNAKAT AIVYPTVLPL
SRCPILDELG EEYSRQLHEH SRFEMASQGL TRTLRPYRFG DSSRLIHWRS SARYGELRVR
ELEVSKAGEE VLICLDSAAE WQPDNFEAAV TAAASLYFYA NRSLLNVRLW TAATGLVYGN
LAMLQTLAGV RFGEEVVVGN PPEKPLVWLT QNSLSLSSLP PGSRWLLWLD ESVGYAAETP
RMQYCSGLNI SSDQPLEFQL QSSV