Gene Acid345_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1684 
Symbol 
ID4069352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2040831 
End bp2042552 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content58% 
IMG OID637983692 
ProductASPIC/UnbV 
Protein accessionYP_590759 
Protein GI94968711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.478922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGC GAATCGGGAA GATCCTGGCC GCGAGTATCT TGGTGTGCGC GGCCTTCGTT 
GCGACGGCCC AGGCGCAGAT CACCTTCAAG GACATCACCC AGCAGGCGGG AATCCACTTC
ACGCACACCA ATGGTGCGAC GGGGAAGAAG TATCTGCCGG AGACGATGGG GCCGGGCTGC
GCGTTCCTTG ACTACGACAA CGACGGATAT CCCGATGTGC TGCTGATTAA CGGGAAGACT
TGGGCGCCAG GCAGCAGCAG CACGATGAAG CTCTACCACA ACAACCACAA CGGGACGTTT
ACCGATGTGA CTGCGAAAGC AGGCTTGTCG GTACCGATGT TCGGGCTCGG GGTTGCGGTG
GGCGATTACG ACAACGATGG CTATGACGAT CTTTTCGTTA CTGCACTAGG ACAGAGCCAT
CTATTCCACA ACAACGGGAA CGGCACTTTC ACCGACGTGA CGAAGCAGGC AGGGATGCTG
GGGCCGAATG AGTTCAGCAC CAGCGCAGCG TGGGTGGACT ACGACAAGGA CGGCAAACTC
GATCTTGTCG TCGCGAATTA TGTGCAATGG ACGCCTGAGA CGGACATCTA TTGCACGCTG
GATGGCAGCA AGAAGTCGTA CTGCACGCCT GAGGCTTATA AGGGCGCATC GGTGCGGCTC
TGGCACAATC TTGGCGGCGG CAAATTCGAA GATGCAACCG CGAAGTCCGG GCTTTTTGAT
TCGACTTCGA AGTCGCTTGG CATCGCAGTG GCCGATGTGA ATGGCGACGG CTGGCCGGAC
CTCATCGTCG CTAACGACAC GCAGCCGAAC AAACTCTACG TCAATCAGAA GAACGGTAAA
TTCCAGGAGA GTGCGGTTTC TGCGGGCATT GCGTTCAGCG AAGATGGTGT TGCGCGCGCT
GGGATGGGAG TGGACGCCGC CGACTACGAT CGGTCTGGCA AACCCAGCAT CATCATTGGC
AACTTCTCGA ACCAGATGAT GGCGCTGTAT CACAACGAGG GCAACACGCT ATTCGTGGAT
GAAGCGCCAC GCTCGGAAGT CGGCCGCAAG AGTTTGCTCA CGCTTGGGTT TGCCTGCTTC
TTCTTTGACT ACGATCTCGA CGGTTGGCCC GATATCTTCG TGGCGAATGG ACACATTGAG
CCGGAGATCG AGAAGATCCA AAAGCGCGTG AAGTACTCGC AGCCTTCGCA CCTCTTTCAC
AACCAGGGGA ATGGACAGTT CACCGAAGTC ACGGGACAGG TGGGAACTGC ACTGGGGACC
CCGAAAGTCG CTCGCAGCGC GGCGTATGCC GATATCGACA ACGATGGAGA TCTCGATCTG
CTGATCACTA CGAACGGTGG TCCGGCGATG CTGCTTCGCA ATGACGGCGC GACGAACAAG
AGCCTGCGCA TTAAGCTCGA CGGGACACGG TCCAATCGCG ACGGCATCGG CGCAGTAGTG
ACCGTGCGCG CTGGGAACGA CAAGCAGGCG CAGATGCTGC GCAGCGGTTC CGGCTATCTT
TCCGCGAGTG AATTGGTGCT GACATTCGGA CTTGGCCAAC ACGCAACCGC TGATGAGGTT
CAGATTACCT GGCCCAGCGG TCAGGTCGAT CATCTCGCGG GTGTTGCTGC AGGACAGACT
GTCGTCGTGA AAGAGGGCGG GTTGGTTGAG CAGAAGCGGC CGTACGGCGC TTCTGCAGCA
GCTCCGAAGG CGGCACGGGC GAAGATCAAA GGTGGGAAGT GA
 
Protein sequence
MNKRIGKILA ASILVCAAFV ATAQAQITFK DITQQAGIHF THTNGATGKK YLPETMGPGC 
AFLDYDNDGY PDVLLINGKT WAPGSSSTMK LYHNNHNGTF TDVTAKAGLS VPMFGLGVAV
GDYDNDGYDD LFVTALGQSH LFHNNGNGTF TDVTKQAGML GPNEFSTSAA WVDYDKDGKL
DLVVANYVQW TPETDIYCTL DGSKKSYCTP EAYKGASVRL WHNLGGGKFE DATAKSGLFD
STSKSLGIAV ADVNGDGWPD LIVANDTQPN KLYVNQKNGK FQESAVSAGI AFSEDGVARA
GMGVDAADYD RSGKPSIIIG NFSNQMMALY HNEGNTLFVD EAPRSEVGRK SLLTLGFACF
FFDYDLDGWP DIFVANGHIE PEIEKIQKRV KYSQPSHLFH NQGNGQFTEV TGQVGTALGT
PKVARSAAYA DIDNDGDLDL LITTNGGPAM LLRNDGATNK SLRIKLDGTR SNRDGIGAVV
TVRAGNDKQA QMLRSGSGYL SASELVLTFG LGQHATADEV QITWPSGQVD HLAGVAAGQT
VVVKEGGLVE QKRPYGASAA APKAARAKIK GGK