Gene Acid345_2739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2739 
Symbol 
ID4069430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3240500 
End bp3242254 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content60% 
IMG OID637984756 
Productaspartyl/asparaginyl beta-hydroxylase 
Protein accessionYP_591814 
Protein GI94969766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGA CGCTGAAATT GCCGTTCCAG TTCGATGTGC ACCGTCTGAA GGCCGATCTC 
GCCCAGGTGC AGCCCGACGA ATGGGTACTG CACCACAATC GCGTGGACTA CGACGGCGGC
TGGAGCGGAG CCGCGCTGCG TTCGGCCGGC GGAGCCACCC GCAACCTTAT TCTGGACTCC
GGACAAACCG CGGATTACGC CGACACACCT CTGCTTGACC GCTGCCATTA TTTTCGCGAG
GTTCTGGCGG CGTTCGACTG CCAGTTGAAG AGCGTGCGCC TCCTGAGCCT CGCGCCGCAA
TCGTTTATCA AGGAACATAC CGACCGCGAG TTGCAGTATG AAGACGGTGA ACTCCGCATC
CATATTCCGA TCCAGACGAA TCCCCGCGTG GAGTTCTATT CCAACGGCGA GCGGCTCTTG
CTGGAAGAGG GAAGCACCTA TTACCTCAAT GCGAATCTGC CGCATCGCGT CAGCAACCGC
GGCGCGAGCG AGCGCGTTCA CCTCGTCATC GACGCCAAGG TCACACCGTG GGTTGACGCA
TTATTTCAGA GGTCGCTAGC GGAAAACTGG TCGGTGCCGC GGGTCGCGGC CGAAGAAGAA
TTCGAAAAAT TCCAGGCATT GCTGTTGCGC GAACCCGCCC TGCAAGCAAA GTTGCGCGCC
ATCTCCAACC CCGACGAATG TTTCACCACG GCTTGCCAGT TGGGACGCGA GTACGGTTTG
CAATTCGACA AGGCCGACAT CCACGCTGCG AAGCTTGCAC TCAACCGCGG CGAAGTGCCG
GGCACAATCG AGTTGGAGAG TTCGGCCGAG TCGCTGCGCG ATTGGGTCCC GGCGCGCGTC
CATGTCCGCG AGCCCGAGCC GATTGTGGAA TGGGCGCACT TTGGAAACCA GCCATTCACT
CAGCCCTTCT TCGAGCAGAC ACTCTCCGCC GCCACCGCCT CGCCGTTCGC GTCAGTCTTT
CGCGGCTATT CCACTCTGCC GACCGCCGCC GACTCCGCGC GCGACGTAGA ACCGCACGGA
TTCATCTTCC ATATGACGCG CTGCGGTTCC ACCCTGCTCA CGCAAATGCT GGCCACGCTC
CCGGAGAATC TTGTGGTTGC GGAAGCGCCC GCCATCGACC ACGTCGCGCA GGCTGACCTC
AAGCTCGCGT CACTCTCGAC GGAACAGCAC ATGCATTGGC TGCGGGCCAT CGTGCGCGCC
ATGGGACGCC CGCGAAACAG TAGCCAATCC AGGTTCTTCG TCAAACTGCA TGCCTGGAAT
ATCCATCAGT TACCGCTCTT CCGCGCGGCG TTCCCAGCGA CGCCTTGGAT CTTCTTATAT
CGTGATCCGC TGGAAGTCCT GGCATCGCAA CTGATGGAGC CCGGCAACTA CAGCGTACCC
GGCATGGTCG ATCCGCGTAC GTTGCAACTC ACCGCTGCCG ACGCAGTTCT CACCGACCGC
TTCGGCTGGT GCAGCAAAAT GCTGGCGCGC ATCTGCGAAT CCGCGGTCCG CTTCCACGAC
ACGAAAGCCC TGTTCGTGAA TTACACCGCG CTCCCCGAAG CCGCATGCAG CGAAGTAGCG
AAGCACTTCG GACTTCACCT GTCGGATGAC GAAGCAGAGA AGATGCGTGC CAGGGCGCAG
TTCCACTCCA AGACCCCGTT CTACTGGGCA GGGAAAGACG AACGTCCCAA AGTCCTCAAA
GACGAAGCAC AACGCCAGAA AGCCACAGAA CTCTTAGGTC CGATTTACCA GGAGTTAAAA
GCCCTGTCAT CCTGA
 
Protein sequence
MFTTLKLPFQ FDVHRLKADL AQVQPDEWVL HHNRVDYDGG WSGAALRSAG GATRNLILDS 
GQTADYADTP LLDRCHYFRE VLAAFDCQLK SVRLLSLAPQ SFIKEHTDRE LQYEDGELRI
HIPIQTNPRV EFYSNGERLL LEEGSTYYLN ANLPHRVSNR GASERVHLVI DAKVTPWVDA
LFQRSLAENW SVPRVAAEEE FEKFQALLLR EPALQAKLRA ISNPDECFTT ACQLGREYGL
QFDKADIHAA KLALNRGEVP GTIELESSAE SLRDWVPARV HVREPEPIVE WAHFGNQPFT
QPFFEQTLSA ATASPFASVF RGYSTLPTAA DSARDVEPHG FIFHMTRCGS TLLTQMLATL
PENLVVAEAP AIDHVAQADL KLASLSTEQH MHWLRAIVRA MGRPRNSSQS RFFVKLHAWN
IHQLPLFRAA FPATPWIFLY RDPLEVLASQ LMEPGNYSVP GMVDPRTLQL TAADAVLTDR
FGWCSKMLAR ICESAVRFHD TKALFVNYTA LPEAACSEVA KHFGLHLSDD EAEKMRARAQ
FHSKTPFYWA GKDERPKVLK DEAQRQKATE LLGPIYQELK ALSS