Gene Acid345_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1371 
Symbol 
ID4068847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1662393 
End bp1663805 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content59% 
IMG OID637983380 
Producthypothetical protein 
Protein accessionYP_590447 
Protein GI94968399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.578419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCCGC CGAAAATATC TTCAGGCCTG CGAGTCCACG GCCTTTACGA GTTGTTTCGG 
CGAGCCGGCG TCACCGCGCA CGATTGGGCC GAATGCGAAG TGACCACTGG ACCGGAATGT
CTCGCGGTGC GACTCGGTTG GGCGCGCCAG AAGCGCATCG AATTCCCAAT TTTCCCCGCC
AAAAGCGCAT GCGACAGGAC GGTCCGTCTC GGCTGGATGT TTCCACCCGA TGATCGCGTG
CACCGACTGG TTCCCGATTT CATATTGCCG CACCATCCCG AACCGGTCCC GCAAGGCTGC
CTCTTCGACC TCGTTTCCGA CCATGTAATC CGTTGCAAAA CCGACCTTCC CACCATTGCT
ATCGCCGTGC TCGCGCGCCT TGAGGAGACC GCGTCCAACG AGCGCGACCC GCATGGGCGA
TTTCCGGCAA ACGCCAGTGT CGCGTTTCGC GAAGGATTTC TCGGCCGCGC CATTGTCGAT
GAGTACGGTT TGGGTCTCCG CCAGGCGTTA GAAGTCCTCT CGCCAGAATG GAGCCCGCAG
CCCCGCGAAT TCGCGCTCAA GCTGTCACAC GACATCGACC GCATCGGTAT CCCGTTCCAC
TTTCGAACCG CTTTAAAGGA GACACTCCGA AGCCGTCGTC CGCAGGACAC CGCCTTGTCC
CTGTTGAGCG CTTTTGCGCT GGTGGATCCT CCCGCGCTTC GATCGGTGGA CCGCATCATC
CAGATCTCGA AAGAACAGAA TTTGAGTTCC GCAGTTTACT GGATGGCCAG CACGCCTTCC
GAGTACGATT CCGGCTATTC GATTTCCGAC CGCCGCGTCC GGCAGCATTT CTTGCATTTC
CTGGCCGAGC ACGTCGAGAT GGGATATCAC GCCAGTTACT TCGCCTTTGG CAACCCCGAG
TTGCTTCGAG CAGAATCCGA GAGTCTGCGA TCGGCGATTC AGACCGATGC GATCGGTGGC
CGCCATCACT ATTTAAGGTG GCTCCCATCG TGTTGGGGAG ATTGGGAAAA GGCTGGGCTG
AAGTACGACA GTTCGGTAGG TTATGCTGAC TGCATCGGCT TCCGTGCTGG TACTTGCATC
CCTTATCGGC CGTGGAATTT CGCCGAGGAC CGGCCCTACG ACCTTCTTGA AATTCCGCTG
CTTGTCATGG AGTGCTCGCT GATCGCAAAG CAGTACATGG GCTTGGGAAG AGGGGAGGCT
GTTGCGCGGG TGAGGGAACT TGCGGAGCGT TGCCGCCTTA CGGGCGGGGA GTTCACTCTG
CTCTGCCACA ACGACATGCT TTTCGACCCC GAGGTCGGCG AAGACTTCTA TCTCTCGCTC
GTGAAGGCGA TCGGCACCCG CGAAAAATAC GACTGGGCTG AAGACTTAAG GACGCAGCGG
CAGCAGTGCG CGCATCCCGC AATGGCGCAC TAA
 
Protein sequence
MFPPKISSGL RVHGLYELFR RAGVTAHDWA ECEVTTGPEC LAVRLGWARQ KRIEFPIFPA 
KSACDRTVRL GWMFPPDDRV HRLVPDFILP HHPEPVPQGC LFDLVSDHVI RCKTDLPTIA
IAVLARLEET ASNERDPHGR FPANASVAFR EGFLGRAIVD EYGLGLRQAL EVLSPEWSPQ
PREFALKLSH DIDRIGIPFH FRTALKETLR SRRPQDTALS LLSAFALVDP PALRSVDRII
QISKEQNLSS AVYWMASTPS EYDSGYSISD RRVRQHFLHF LAEHVEMGYH ASYFAFGNPE
LLRAESESLR SAIQTDAIGG RHHYLRWLPS CWGDWEKAGL KYDSSVGYAD CIGFRAGTCI
PYRPWNFAED RPYDLLEIPL LVMECSLIAK QYMGLGRGEA VARVRELAER CRLTGGEFTL
LCHNDMLFDP EVGEDFYLSL VKAIGTREKY DWAEDLRTQR QQCAHPAMAH