Gene Acid345_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3360 
Symbol 
ID4071278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3986391 
End bp3987770 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content59% 
IMG OID637985382 
Producthypothetical protein 
Protein accessionYP_592435 
Protein GI94970387 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGC GTCGGAAGGG TTTGTTGATC GCGCTCTTGG CAGTTGTTTG CAGCTTCGTG 
CTGTTGCTGC CTGCCTTGGC CGAATCCAAT GTCCGCATCG TTCGCCTGAG CTACATTGAT
GGCGACGCCC AAATCAACAC GACAAACCAG GATGACGGAT TCACTCATGC CGTTCTCAAT
ACGCCGGTAA CAGCGGGCAT GTGGATCTAC ACGCCGAACA ACTCGCACGC TGAAATCCAG
TTTGAAAATG GCAGCACCGT GCGAATGGTG GATGACGCGC AAATCCAGTT CGAAAAACTC
GCGCTTGCCG ATTCCGGCGG AAAGATCAAC ATCATTAATG TTGATCACGG CGTGGTGTAC
TTCAACTTCT CGAAGGTAGG CAAAGACGAC AACATCATCG TCAAGGCCGG CGCCAAGACC
ATCCACGTTG CAAAGTCCTC GCACTTCCGC GTCGACGCCA GCGACAAGAA TGTTCTCGTC
TCCGTGTTCA AGGGCGACGC CATGGTTGAC GGCGACCAGT CGATCGAGAT TAAGAACAAC
GAATCCGTGA ACCTTGCCGC CGAAGACGCG AAGGTTGGTA GAGGCGTCGA CGAGCTGGGC
AGCGACACGT GGGACAAGCA TCGTGACGGA GAGGTTGCGG CGCTGAGCAT GAAGGCCGCC
CCGGTTGGTT ATGGCGATGC GTATAGCTCA CAGTTCGGAT ACCTTGGTTC TTACGGCAAT
TACACCAACG TCCCGGGCTT CGGTTGGGGA TGGCAGCCCT ACGGCATGGG AATGGGTTGG
GACCCGTTCA TGAACGGCGT CTGGAACTAC AACCCAGGCT TAGGCTACAT GTGGGTTTCG
TCGTATCCGT GGGGATGGGG ACCGTATCGC TACGGTGCCT GGAACTACGT TCCGGCCTAC
GGCTGGATGT GGATGCCCGG CTCGAGTTTC AATTCCTGGA ACGTGGGTCC AGCATACGGA
GCTGTGCCGG CTAACTGGCA CGCACCAACC GTTCCGGTTG TCGGCAAGAC TCCGGTGAAG
ACGGTTGTGG TGGGCAATCC ACCGAACGTG CACCCGGCAA TTCTCGCGGG ACATCCTGAA
GGTGGATCGC ATGCCGCAGT CTCGACGCGC GCGAAAGCTT CGAACAATGT CCGCGTGAAG
CCGCCTGTGG CGACTGCGAC GTCCGGCGCC AAGCCGACGT CGAATACAAC CGCCACGAAG
ACTGGCACGA GCACTGGCGC GAAGAGCGGT GCGCAGCCTG CGCATGCCGG CGGCGCACAG
CACGCGAGCG GGGGACAACC CTCTGGTCAA CACATGGGCG GACCGCCAAC GGGTGGTGGT
CAACGCATGG GTGGCGGAGC ACCGGCCGGA GGCCATCCGC CTGCGACTCG TCCTCACTAA
 
Protein sequence
MLQRRKGLLI ALLAVVCSFV LLLPALAESN VRIVRLSYID GDAQINTTNQ DDGFTHAVLN 
TPVTAGMWIY TPNNSHAEIQ FENGSTVRMV DDAQIQFEKL ALADSGGKIN IINVDHGVVY
FNFSKVGKDD NIIVKAGAKT IHVAKSSHFR VDASDKNVLV SVFKGDAMVD GDQSIEIKNN
ESVNLAAEDA KVGRGVDELG SDTWDKHRDG EVAALSMKAA PVGYGDAYSS QFGYLGSYGN
YTNVPGFGWG WQPYGMGMGW DPFMNGVWNY NPGLGYMWVS SYPWGWGPYR YGAWNYVPAY
GWMWMPGSSF NSWNVGPAYG AVPANWHAPT VPVVGKTPVK TVVVGNPPNV HPAILAGHPE
GGSHAAVSTR AKASNNVRVK PPVATATSGA KPTSNTTATK TGTSTGAKSG AQPAHAGGAQ
HASGGQPSGQ HMGGPPTGGG QRMGGGAPAG GHPPATRPH