Gene Tery_3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3350 
Symbol 
ID4243444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5136706 
End bp5139645 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content37% 
IMG OID638108334 
ProductGUN4-like 
Protein accessionYP_722925 
Protein GI113476864 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTC TTCACTTAGA TTTAAAATTA ATAGGGGATA ATTATGCTGA AGTCCGATAC 
TTTTGGAATA ATCCCAATGA TTACCAATCT TATCAATTAT CTTTGACAGA AACTTCTGAT
GTCATCAAAA AAATAGATAG TATTTATAAT ACAAGACTAC CAGAAGACTA CGCCAAAACT
GGTCAAACAC TTTATAACTG GTTGGATGGA AATAATCGTA TTTTTCGACA AGAACTTGAT
AAATATCAAC GGGAAGGAAT TGTTTTAGCT ATTTCTACTT CAGAAAGATT AGCTCATTTA
CCTTGGGAGT TGCTCCATGA TGGTCAAGAA TTTTTAGTTC AGCGTAAACC TGCTATTATT
CCTGTTCGTT GGGTGACCGA TAGTCAACAA CTTTCTTTTC AAAATCAACC TAATAACCGA
GCATTAAATG TGGTGTTTAT GGCAAGTTCC CCCCGCAACA CGGGAGACTA TTTTCGTGAA
CTTGATTTTG AAGCGGAGGA GGGAAGTATT TTGGAAGCAA CCAAAAGATC GCCTTTGTTT
TTAGAAGTAG AAGAAAGTGG TTGTTTGACT GAGTTAGGTT ATCTGGTGGA AAGCCAGGAA
GAGAATTTTT TCGATGTTGT TCATCTCACA GGTCATGCAA CTTTTGAGCA TGGCAAACCT
TGCTTTATTA CTGAGACAGA ATATGGGGAA GCAAAATATA GCAGTGCGGA AGATATTGCT
AAAGAATTAC AATTTCAGCA CCCGAAACTA ATTTTTTTAT CGGGTTGTCG GAGTGGATAT
TCACATCACG AAAATGTTCC ATCAATGGCA GAATCTTTAT TAAGTCAAGG TGCAACAGCA
GTTTTAGGAT GGGGCGGTTG GGTTTTAGAT ACAGAAGCAA CAAAAGCAGC AGCAAAGCTA
TATCAGGGGT TGTCCTTTGG TAAAGGGGTA ACGGAAGCAT TAGGGGAAAC TTACCAGGAA
TTAATTGAGG TTGAAGCTAG AGATTGGCAT AAATTACGAC TTTATGTAGC GGGTAGTTTA
CCGGGAGCGT TGGTAACCCC TTTGAGAAAA CGAGGTCGGA AACCAGCACC CCGTTACCAG
AAAACCATTG AATTTAGAGA CCCAGAAGGT AAACTTAGGG TGGCCACTCG TGAAAATTTT
GTTGGTCGTC GTCGTCAGTT ACAAAATTGT CTCCGTACTT TTAAGTTAGA TTCTGATAAG
TTGGGAGTGT TAATTCATGG TATGGGAGGT TTGGGAAAAA GTACCATTGC TTCTCGACTT
TGGGAGCGTT TATCGGAATA TGAAAGGGTT TCATGGTGGC GACAAATTGA TAATTCTAAT
TTAGTGGAAA AATTGGCAGA TAAATTGAAA GATGCTGACC TGCGTTTTGC CTTAAAAGAT
AATCAAAATG AACTCAAATA TAGGTTGCGA GATTTATTTG AAAATTTAAG TGAAGCTGGA
GCAAAACCTT TTCTTTTCCT GTTTGATGAC TTTGAATGGA ATCTTGAACC TCGTCAAGGT
GGATATGTTT TGAAGCCTAA AGCAGCAGAA GTTTTAAATG CTTTAATTTG GGCAATTCAG
GAAGTAGGTG CGAGGGAAAA AATTATTATT ACTTCTCGTT ATAAGTTTCA ATTTGAGTTA
TTAAGTGAGT TTTGGGTACA AGGTTTAGAA GCATTTCGCA AAGCTGATTT AGAGAAAAAT
TTGAAACGTC TGGAGAATTT TAATTCGGGA AAAATTGATA AACAATTAAT TGAAAGAGCG
TTAAAACTTG CCAACGGAAA TCCTCGTTTA TTGGAATGGT TGGATAAAGA TGTACTTGCT
TGTGGGGATA TTGACGGGAA GTTAAGTAAA CTTGAATCTA GTTCTGAAGA TTGGCAGGGG
AAAATTATTT GGCCTGAACT TTATGAACAA ATTGATGAAA AAATGATTCA GGTATTGAGT
CATTGTTTGG TGTTTGAAAT TCCTGTACCG ATGTCAGCTT TAGAGGTAGT TTGTGAGTCT
ATTTCTGGTT GTAAAGAACA ACTTAAAAGA GGGATTGATT TAAGTTTGAT AGAAGTTAGT
TCAGAAGTAG AAGAACCTGA GCGTGTTTAT CAGGTTTCAC GAGTGTTGCC TCCTATTATT
TCGAGTATTA AATTACCTAA AGCTCCTGAA GCTTATTCTT TCTATGAAAA AGGTTCAGAA
AAATTGTTTA AGTTGTGGGG TACAATACCC AATCAAAACT GGAAAAGATG GCGTGAGATT
TTCCGGTTAA AGTTTGCCCA TAAAGAAAAT CCTGAGCGAT TTCGACAGGG GTTGTCTTCA
ACGTTATCCT TCTATTATGA TAATCAGGGT GTGCACTATT ATAATTTAGC CCATGCTGCT
TATTGGTCTG AACTTAAGGA AATCAAAACT GAACTAGAAG AACTGCTACA GGATAATAAT
TTGGAAGAAT ATTTGCGTCA TAATAATTTG GAAGAATATT TGCGTCAAAG TGAATGGAGA
AAAGCTGATG AGGAGACACT AGTAATTTTT TATCAAACAA CAGTAATTTT TCCTGAATTA
ATAAAATTTC AAGGTTCTGA TGGTGTAACA TCTTTTATTT ATGATGATAG GGAGATTCCA
CGGTTAACAC TTGAGTACAT AGACCAACTT TGGGTAAAAT ATAGCCATGG TAAGTTTGGT
TTTTCCGTCC AAAAAAAAAT TTATCAGAGT TTGGGTGGAA AAGGAGAGTA TGACAGGGAA
GTATACGAAG CATTTGGTAA TGAAGTCGGA TGGCGTTCAG GAGAAAAACA GTTGTCTTAC
TCTGAACTAA CTTTTAGCTT AGATACGCAT TATACGGGGC ATCTCCCATT TCAGGCATGG
AATCATTGGG AAGAGGCTAG TAAATTCGCT GCGGCTATGA CAGGGAAGTA TCCAGTGAAG
TTTACAGTTT ACTACTATAC GAAGCATTTG ATTAGGTATC CGAGAAATAT GGAGATTTGA
 
Protein sequence
MQILHLDLKL IGDNYAEVRY FWNNPNDYQS YQLSLTETSD VIKKIDSIYN TRLPEDYAKT 
GQTLYNWLDG NNRIFRQELD KYQREGIVLA ISTSERLAHL PWELLHDGQE FLVQRKPAII
PVRWVTDSQQ LSFQNQPNNR ALNVVFMASS PRNTGDYFRE LDFEAEEGSI LEATKRSPLF
LEVEESGCLT ELGYLVESQE ENFFDVVHLT GHATFEHGKP CFITETEYGE AKYSSAEDIA
KELQFQHPKL IFLSGCRSGY SHHENVPSMA ESLLSQGATA VLGWGGWVLD TEATKAAAKL
YQGLSFGKGV TEALGETYQE LIEVEARDWH KLRLYVAGSL PGALVTPLRK RGRKPAPRYQ
KTIEFRDPEG KLRVATRENF VGRRRQLQNC LRTFKLDSDK LGVLIHGMGG LGKSTIASRL
WERLSEYERV SWWRQIDNSN LVEKLADKLK DADLRFALKD NQNELKYRLR DLFENLSEAG
AKPFLFLFDD FEWNLEPRQG GYVLKPKAAE VLNALIWAIQ EVGAREKIII TSRYKFQFEL
LSEFWVQGLE AFRKADLEKN LKRLENFNSG KIDKQLIERA LKLANGNPRL LEWLDKDVLA
CGDIDGKLSK LESSSEDWQG KIIWPELYEQ IDEKMIQVLS HCLVFEIPVP MSALEVVCES
ISGCKEQLKR GIDLSLIEVS SEVEEPERVY QVSRVLPPII SSIKLPKAPE AYSFYEKGSE
KLFKLWGTIP NQNWKRWREI FRLKFAHKEN PERFRQGLSS TLSFYYDNQG VHYYNLAHAA
YWSELKEIKT ELEELLQDNN LEEYLRHNNL EEYLRQSEWR KADEETLVIF YQTTVIFPEL
IKFQGSDGVT SFIYDDREIP RLTLEYIDQL WVKYSHGKFG FSVQKKIYQS LGGKGEYDRE
VYEAFGNEVG WRSGEKQLSY SELTFSLDTH YTGHLPFQAW NHWEEASKFA AAMTGKYPVK
FTVYYYTKHL IRYPRNMEI