Gene Tery_3342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3342 
SymboluvrC 
ID4243513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5124169 
End bp5126037 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content38% 
IMG OID638108327 
Productexcinuclease ABC subunit C 
Protein accessionYP_722918 
Protein GI113476857 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.991937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACTC AAATTTTACC TATAATAAAA GACTCTGAAC GTTTAGAAAA TAGACTCAAA 
GAGATTCCCC AAACTCCGGG AGTTTATCTC ATGCGTGATC GTAGCGATCG CATCCTTTAT
ATTGGTAAAT CTAAAAAACT TAGAAACCGA GTTCGTTCCT ATTTTCGAGA TGGAAAAAAT
CACACTCATC GTATCAGTTT AATGATACAA CAAATCGTTG AAATTGAATT TATTGTCACA
GATACAGAAG CTGAAGCTTT AGCATTAGAA GCTAATCTTG TTAAGCAACA TCAACCATAT
TTTAATGTCT TACTCAAGGA TGATAAAAAA TATCCCTATC TTTGTATAAC TTGGTCGGAA
GATTATCCGA GAATTTTTAT TACTAGAAAA CGCAGATTAG GTAAAGCTAA AGACCGTTAT
TATGGTCCTT ATGTGGATAC TAGATTACTC CGAAATACTC TACATTTAGT TAAACGAATT
TTTCCCCTAC GTCAAAGACC AAAACCTTTA TTTAAGGACC GCTCTTGTCT GAATTATGAT
ATTGGTCGTT GCCCTGGAGT TTGTCAGGAG TTAATTACTC CAGAGGAATA TCATAAAATT
GTGCAAAGAG TGGCAATGAT TTTTCAAGGT CGAACTGGAG AATTAATTGA TATTTTAAAT
ACTCAAATGG AAAAAGAAGC GGAAGCGCTA AATTTTGAAA AAGCAGCATT TATTCGTGAC
CAAATTAGAG GTTTAAATTC TTTAAATGCT GATCAAAAAG TTTCTTTACC AGATGATAGA
GTTTCTAGAG ATGCAATTGC TTTGGCGGGA AATCATCAAA TTGCTTGTGT TCAACTATTC
CAAATTCGGG CAGGAAAATT AGTAGGTAGG TTGGGATTTA TTGCTGAAAT TCCTGATTTA
GGAACTGCAG AAAATCAGGA GTTGGGAGTA ATTTTGCAGC GGGTTTTAGA ATCCCATTAT
CAAATAGTAG AATCTGTAGA AATTCCTACG GAAATTATTG TGCAAAATGA GTTGCCGGAG
AGTAATTTTT TGCAAAATTG GTTAACAGAA AAGAAGGGTA AAAAAGTAGA AATTTTTGTA
CCTCAACGCC AGGGGAAAGC TGAATTAATT GAGATGGTAA AAAAGAATGC TGAGTATGAG
TTATTGCGGT TGGCAAAAAT GAGCGATCGC AATAATGAAG CGATGACAGA TTTAGCAGAA
ATCTTAGATT TACCAGAGTT ACCTCACCGG ATAGAGGGTT ATGATATTTC CCACATTCAA
GGTTCGGATG CGGTGGCATC ACGGGTAGTG TTTATTGACG GTTTGCCAGC AAAACAACAT
TATCGACATT ATAAAATTAA AAACCCAGAA GTGCGTTCCG GTCATTCTGA TGATTTTGCC
AGTATGGCTG AGGTTATTGG GCGACGGTTT CGTGACTATG AAAAACAAAC TACTACAGAC
AAACCAGATT TAATTATGAT AGATGGTGGG AAAGGGCAAC TGTCAGCAGT AGTGGCAGTG
ATGGAAGAGA TGAATATATT AGAAGAAGTG CGGGTGGTGA GTTTGGCAAA ACAACGGGAG
GAGATTTTTT TGCCTGGGGA GTCTGAACCT TTAAGAACTG ATGCGGAACA ACCGGGGGTG
CAGTTGTTGC GGAGGCTGCG GGATGAGGCT CATAGATTTG CTGTGAGTTT TCATCGGGAT
AGAAGAAGTC AGAGGATGAA GCGATCGCGT TTAGATGAGA TTCCTGGTTT AGGGCAACAT
CGACAAAAGT TGTTATTGGG GCATTTTCAT TCTGTTGATT ATATTAGGAT GGCAACGGTG
GAACAGTTGG CAGAAGTGTC GGGAGTCGGA CCTAAGTTGG CACAACAGAT TTATGATTAT
TTTCATTAA
 
Protein sequence
MITQILPIIK DSERLENRLK EIPQTPGVYL MRDRSDRILY IGKSKKLRNR VRSYFRDGKN 
HTHRISLMIQ QIVEIEFIVT DTEAEALALE ANLVKQHQPY FNVLLKDDKK YPYLCITWSE
DYPRIFITRK RRLGKAKDRY YGPYVDTRLL RNTLHLVKRI FPLRQRPKPL FKDRSCLNYD
IGRCPGVCQE LITPEEYHKI VQRVAMIFQG RTGELIDILN TQMEKEAEAL NFEKAAFIRD
QIRGLNSLNA DQKVSLPDDR VSRDAIALAG NHQIACVQLF QIRAGKLVGR LGFIAEIPDL
GTAENQELGV ILQRVLESHY QIVESVEIPT EIIVQNELPE SNFLQNWLTE KKGKKVEIFV
PQRQGKAELI EMVKKNAEYE LLRLAKMSDR NNEAMTDLAE ILDLPELPHR IEGYDISHIQ
GSDAVASRVV FIDGLPAKQH YRHYKIKNPE VRSGHSDDFA SMAEVIGRRF RDYEKQTTTD
KPDLIMIDGG KGQLSAVVAV MEEMNILEEV RVVSLAKQRE EIFLPGESEP LRTDAEQPGV
QLLRRLRDEA HRFAVSFHRD RRSQRMKRSR LDEIPGLGQH RQKLLLGHFH SVDYIRMATV
EQLAEVSGVG PKLAQQIYDY FH