Gene Tery_4936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4936 
Symbol 
ID4246590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7515207 
End bp7517489 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content36% 
IMG OID638109748 
Productprotein-arginine deiminase 
Protein accessionYP_724324 
Protein GI113478263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.370657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGGA TACAATACAT AAAAATAGCT CTAGTAAATA CTGTTGGGGC ATTAGTAGGA 
ATTATCTGGG TACCTGTACC TCAAGCAGTG GCACAATCTT CACTTCCTGA GCCAGCTAAC
TCAGAAATTT CTGCCTTAGA AACTATCAAG TCGATAAACA AGAACTTAAA AATTCCTAAA
ACAATAAGTT CACTGACAGA ACTGTACCAG ATAAAAGATC AATTAAAAGT AGAACTAGAG
CGAGTCTCTA AAATGCCAAA CTTCAAGAAA GTTAGGGAAC CTTGGCAATA CCAATTCCAG
TTACAACAGT ATGAAAAAAC TTTGAAAGAC TTCCGCAGTA TAGAAGCAAA AATTATTAAA
GAAGAAAAAG CTCTTCAAAG TTGGAAACAA GCACTGAGCA TAGCAACCAA TGCTGTAGCA
AAAGGAAAAA AACTACAAGC AAATTATCAA ACCTGGGAAG AAGCAGAAAA TCTTTGGTTA
GAGGCAATTT ATAGTTTACG TCAAATTCCT CAAGATTCAT TGATGACAGA TAAAGCGATC
GAAAAAATGC TTGAATATCA GGGGTATCTA GCAGTGGCCT GTTATGAAAA AGTAATGGCA
GTAAGAGTAT GGGAAAAAGA GAAAGAAAAG AAGACAAATA GCCATACCAA AACTTATCGA
CCCATTGCAT ATAGTTTATC TCCAGGTTTT ACCATGTATG GTGATACTAA CAGAGATGGG
AAAGTGGATG AAACTGACAA GTCAGGTAGA GAAAACTGGT CTTTATCAGA AGGTGCATTA
ATGTTATTTA ATAATGATGA TGATAACGGC GATCTAATCC CAGATTGGCG AGATGGGGAT
GTTAATGGTG AAAATGATGT AGCAGATTTA GCTATTGTCA ATATTCGATT AGCAGAAAAC
TATAGAGATG CTCAAATCTA CATCTCTACT GACGCAGATG TTAATAACTA TATCAATATC
TTCCAAAAAA CAGAATCTGG ATGGCAACCA GTAGATCTTT CTGGCACTGA AGCATTAATA
TCTAGAGCAA AGATAGTATT AGGTGTAGAA GCTAAACAAT TTGCTGACAG GGACTGGAAA
GGAGTTGTTA ACTTGACAGC GATAGCTAAA AAAAATGGTA GACAAATTGC TTCTGATAGT
ATTCAAATAG GTGTAGTTCC TTGGTTAATG TCTCCTAATA CTGCTCCAGT CAAAGAACTG
CACGTTAGTG AGCGAGGTTT AGCTAATCAA AAGTTTATCA GTAAAATCAC TGAAATTATT
GAAAAAACAG GTGTGACAGC TAAAATTAAT CCTGGCGGAA CAACCTGGAT GCAAGATACT
AAAAAAATAG GTTATGTGCA GTTTCCCACT CAAGGTAAAA TGCACAATAT GAATGTGGTA
CTTAAAGGTA ACCGTCTACA GGAAAATGAT CAGTATGCTA GAAGCCTCCT CAAAGAAAAT
TTTGGTTGGT TTGAAGTTGG TAAACCCAGA CAACTAGATC CTCTCAACAG GTGGGCAGAT
GCTTATGGAA ATCTAGAAGT AACACCACCT TTACCAGGAT ATCCTATGGG AAGAGTTTAT
TATGGGAAAG CAGGTGAAGT GGGTATGAAT CCTGATATTA TTGATTTCAT CAAAGCACAG
AAAATTCAGG GTCCCCCAGT AGATATTGAT ACTTCTTGGT TAATGATGCG TCATGTAGAT
GAAATTATTA GTTTTATTCC TAGTAAAACT GGCAAACCTC TAATGTTAAT TGTGAGTCCA
GAAGCAGGGG TCAAATTATT AGAAGAACTC AGTGAAGAAA ACTATGGTAA AGCTGCTATA
AATCGTGGTT TAAGCACTCA AATAACAGTG CGGGCTGCGT TGAAAAATGA AAAGTTAGTT
CAGCATAATC TCTATTTACA ACGGGAAAAA TTAAACCCAT TAATTGAGAA GTTAAAACAG
GAATTTAATC TGAGCAATGA CCAGATAATT CAAGTACCAG CTATGTTTGG ATATAGCGGT
TATGCTTGGT GGCCGAATAT GGTTAATTCA GTGGTAATTA ATGGAGAATT ATTGGTTTCT
AGTCCCGGAG GAGCATTAAT TAATGGTCGA GATTATACTC AAGAGAAATT TCGCAGGTTG
GTGTCGAATT CAAGCTTAAA TATTAATTTT ATGGATGATC AATATTATCA ACAACTAAGA
GGAAATGTAC ATGATGCTGT GAATACAACT CGCTTAGGAA AAAATCATCC TTTCTGGAAA
TCCTTATCTG AAAATATATT AGGGTTTAGA GGGCAAAGTT TAGATATGGT AGATACGAAA
TAA
 
Protein sequence
MGWIQYIKIA LVNTVGALVG IIWVPVPQAV AQSSLPEPAN SEISALETIK SINKNLKIPK 
TISSLTELYQ IKDQLKVELE RVSKMPNFKK VREPWQYQFQ LQQYEKTLKD FRSIEAKIIK
EEKALQSWKQ ALSIATNAVA KGKKLQANYQ TWEEAENLWL EAIYSLRQIP QDSLMTDKAI
EKMLEYQGYL AVACYEKVMA VRVWEKEKEK KTNSHTKTYR PIAYSLSPGF TMYGDTNRDG
KVDETDKSGR ENWSLSEGAL MLFNNDDDNG DLIPDWRDGD VNGENDVADL AIVNIRLAEN
YRDAQIYIST DADVNNYINI FQKTESGWQP VDLSGTEALI SRAKIVLGVE AKQFADRDWK
GVVNLTAIAK KNGRQIASDS IQIGVVPWLM SPNTAPVKEL HVSERGLANQ KFISKITEII
EKTGVTAKIN PGGTTWMQDT KKIGYVQFPT QGKMHNMNVV LKGNRLQEND QYARSLLKEN
FGWFEVGKPR QLDPLNRWAD AYGNLEVTPP LPGYPMGRVY YGKAGEVGMN PDIIDFIKAQ
KIQGPPVDID TSWLMMRHVD EIISFIPSKT GKPLMLIVSP EAGVKLLEEL SEENYGKAAI
NRGLSTQITV RAALKNEKLV QHNLYLQREK LNPLIEKLKQ EFNLSNDQII QVPAMFGYSG
YAWWPNMVNS VVINGELLVS SPGGALINGR DYTQEKFRRL VSNSSLNINF MDDQYYQQLR
GNVHDAVNTT RLGKNHPFWK SLSENILGFR GQSLDMVDTK