Gene Acid345_3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3793 
Symbol 
ID4071077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4479648 
End bp4481531 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content56% 
IMG OID637985816 
Productheparinase II/III-like 
Protein accessionYP_592867 
Protein GI94970819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTTGGT ATTTCGCACG TATGCGCCAG ATGGGGCCGG CGGAGGTGCT TTATCGCACT 
CGCCAGGCCG CCAACGTACC CATTGATTAT TTGGGTAAAG GCCAACGCGC GCAATTTCAG
GCTCCCCACG GGGAATGGTG GTCGCCGAGG AAATACCCTG TCACGTTCAA CCGGGCCGGC
GCCTCGTTCG AAAAGATACG AATCTTCGAT CTCGAGTTCC CTGCGGATTT CGAGTTCGAC
TGGCATCGTG ATTACCGCAA CTCCAAGTCG GTAGAGCGCA AATTCTCGCG CAGCCTGGAT
ATCCGCGATC CCGAAGTAGT TGGCGACATT AAGTACATTT GGGAGATCAA CCGTCACCAG
CATCTGAGTG CGCTCGCATA CTCGTCCCGT CCGGACGCGT CGGACATCGT TGCGCGCTCG
CTATCGGACT GGCTCGATAG CAATCCGTAT ATGGAGGGTG TGAACTGGAC GAGCAGCCTG
GAATTCGGTT TACGGCTCAT CTCCTGGGCT GCCATTTTTC CGACAATGCG AGAACACTTC
GCACGCAATA AAGCACTCCG CGAGAAGTTG GCGACTTCAG CCTACTTGCA TATGAAGGCG
ATTCGGCGGC ATTTGAGCCG TTACTCTTCC GCGAACAACC ACCTGATTGG GGAACTCGCT
GGATTGTATG TGGGTGCGAC CTGCTTCCCC TGGTGGAATG AATGTGACAC CTGGCGAGAC
TTCGCTCGGA GAGAACTGGA ACGTGAGATT CTCGCGCAGT TCACTCCGGA AGGTGTAAAT
CGCGAGCAAG CGATGTCGTA CCAGTTCTTC ACGCTCGAAA TGTTGCTCTT TGCCGGATTG
GTCGCACGTA ACTCTGGCGA CGCCTTTGAA GGGGCGTACT ACGAGCGCCT CCGCAAGGGT
TTAGATTATG TTCTTCTCGT CGCGACGAAG AGTGGCGACC TGCCATGGTT TGGTGACTCG
GATGATGCCC GCGGATTCTT GTTTTCTCCG AATGAATCAG AATTGCAAGC AGTCATGAGT
TTGGGGGCTG GGCTTTTCGA TGATGAACGA TACCTGTCCT TTGCGCCCCG CGGGACGGTT
GCAAGCAAGG CATTGCTCGG CCCAGCAACA GAACCGATCG TCCGCCGCGC GAGTACGCCG
CAAGTCGGAA CTGCGGAACT CCTGCGTGAA GGCGGTATCG CCGTAATACA GGCCGATGAC
TGGAAAGTCG TAATGGACGT GGGTCCGCTG GGATTCACAA CCATCGCGGC GCACGGTCAT
GCAGATGCCC TTTCGTTATT GCTGGCAGTG AAAGACCGGT ACGTTCTTGT CGACCCCGGT
ACCTATGCGT ACCATTCCCA TCCGGAGTGG CGTGCGTATT TTCGAGGCAC TGCAGCGCAC
AACACCGCGC GAGTCGATGG CCAAGATCAA TCTGTCATGC GCGGAAGATT CCTGTGGGAC
CAGAAGGCGA ACGTGAAGGT AAAGCACTTC GCCGAAGAGG CCACCGAGAT CTGCATTGCA
GCGGAACACG ATGGTTACAC CCGGCTCTCT GATCCGGTGG TACATCGTCG CGCAGTAACG
GTGAAGAAGC AGGAACGGAT GATCGAAGTT GAAGACGCCT TCGAATGTAA GGGCGATCAC
TCCATCGAAC TATATTGGCA CCTCGTTGAA TCGCTGACGC CGGAGGCGTT GCCGGGCGGC
AGTATTCGCG CCGAGGGCGA TGGGGTGAAA TTGGACTTCA CTTTCGAAGG ACAGCAAGGC
GACATTGCGA TCATCCACGG TTCGGAATCG CCTATTCTGG GATGGCGATC GACGGAATTT
AACGTAAAAC ATCCGACTTC CACGGTGCGC CGCTCGCTGC GTATTCGCGG AACGCAATCC
ATTAAGACAC GAATTCAGTT TTGA
 
Protein sequence
MTWYFARMRQ MGPAEVLYRT RQAANVPIDY LGKGQRAQFQ APHGEWWSPR KYPVTFNRAG 
ASFEKIRIFD LEFPADFEFD WHRDYRNSKS VERKFSRSLD IRDPEVVGDI KYIWEINRHQ
HLSALAYSSR PDASDIVARS LSDWLDSNPY MEGVNWTSSL EFGLRLISWA AIFPTMREHF
ARNKALREKL ATSAYLHMKA IRRHLSRYSS ANNHLIGELA GLYVGATCFP WWNECDTWRD
FARRELEREI LAQFTPEGVN REQAMSYQFF TLEMLLFAGL VARNSGDAFE GAYYERLRKG
LDYVLLVATK SGDLPWFGDS DDARGFLFSP NESELQAVMS LGAGLFDDER YLSFAPRGTV
ASKALLGPAT EPIVRRASTP QVGTAELLRE GGIAVIQADD WKVVMDVGPL GFTTIAAHGH
ADALSLLLAV KDRYVLVDPG TYAYHSHPEW RAYFRGTAAH NTARVDGQDQ SVMRGRFLWD
QKANVKVKHF AEEATEICIA AEHDGYTRLS DPVVHRRAVT VKKQERMIEV EDAFECKGDH
SIELYWHLVE SLTPEALPGG SIRAEGDGVK LDFTFEGQQG DIAIIHGSES PILGWRSTEF
NVKHPTSTVR RSLRIRGTQS IKTRIQF