Gene Acid345_3407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3407 
Symbol 
ID4072743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4028799 
End bp4030310 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content60% 
IMG OID637985429 
Producthypothetical protein 
Protein accessionYP_592482 
Protein GI94970434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.391137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACTCT TTCCTCTTGA CCTAACCGAA GCCCGCGCGG CGCTGGTGGC GCAAGCCAAG 
GCCAACACCG CCGAAGAAAA GAAAGCGGCC GAGGCGCTGG GCCCGCAATT GTTCCAGCTT
GGCGATCCGG AGAATGTCGC GTTCCAGCGG ATCACGTCGC CGAACACGCG TCGCGATCTG
AACCCGCTCA TGCACGAGCG GATGCAGCAG GTGTGCTTCT ACCTGTCGGT GACAACGCCG
ATGGGTAAGC GCATTGTCGA GATCATCACC AGCTACGTGG TGGGTGAAGG CTTCAACGTG
GTGGCGAAAG ACCCCGCGGT GCAGGAGATC ATCGATCGCT TCTGGAAAGA CCCGATTAAC
GATTTCGAGC GATCGCTGCG CGACTACTCC AACGACATCT CGGCGTTCGG CGAAATCTGT
TTACCGGTGG CGGTGAATCC CGTCGACGGG TTCGTTCGCA TGGGCTGGAT CGACCCGATC
GAGATCGACG CAATTGAGTA TCAGCCGATG ATGACGGCCT TTGGTGAAAC GACCATCACG
GTGCCGGCAT TCGTGCGGTT GAAGAAGAAG ATCGGCGAGC TGCAACCACA GCGCCTGCAG
ATCGTCCACA CCGATGAGGA TCCCAACTCG GAAACCTTCG GCCAGCTGGT GGGCGATTGC
TTCTTCTTCG CAGTCAACAA ATCGAAGGCC GCGAGCCGCG GATTGAGCGA CCTGTTCTGC
CTGGCGGACT GGCTCGATGT GCTGGACCAG ATGATCTTCG ACTTCGCGGA TCGCGCGCGC
TTCCTGAACA TGTGGGTTTG GGACGTGACG CTGAGCGGCG CCGACGACAA GCGGGTGAAG
GAGTTCAACC GCGACCTGAC GAAGGCCCCG CCGCGCCAGG GCGGCGTGCT CACGCACAAC
GAAGCGGTGA AGATCGAGGC CAAGACGCCG GACATGAAAG GCCAGGACTT CTCGGCCGGC
GCGCAGATGG TGAAGACCTA CGGACTAGGC GGCATCGGAC TTCCGCCGTT TATGTTCGCG
GATCCCACAG ACGTGAACCG CTCGACCGGC GAAGTGATGG AAGGCCCCAC AGGGAAAAAG
CTCACCGATC GCCAGAACGA AATCAAGCGG CTCATTCGCC AGGTCGTCCA GTTCGTGCTG
CACCAGGCGG TAGCGCATGG TGTGCTCAGC GCCAAGGCCG ATCTCGACTT CGACCTGCAG
GTGCCGGACC TGCTGATCAA AGATTTGCAG AAAGCCGGAA CCACGATGCA GGCGGTGACC
GGTTCGCTGG CAGCGGCAAA AGATGAAGGC TGGATCACGG AAGAGACGGC CGCGCGCTCG
TTCCACGTTG TGCTCTCGCA GATCGGCGTG CAAGTGGATT CACAGGATGA ATTCGCGGCG
GCGCAAAAAG AAAAACGGCA ACGCGATGCA ACCGCGCAAA ACTCGCTGAA TGACCAGAAG
AACCTGGCCG ATGCACTGAA CAACCTGGGC AACACGGATG CGAAGAAGGC TGCATCGGGA
GTTGTGCAAT GA
 
Protein sequence
MKLFPLDLTE ARAALVAQAK ANTAEEKKAA EALGPQLFQL GDPENVAFQR ITSPNTRRDL 
NPLMHERMQQ VCFYLSVTTP MGKRIVEIIT SYVVGEGFNV VAKDPAVQEI IDRFWKDPIN
DFERSLRDYS NDISAFGEIC LPVAVNPVDG FVRMGWIDPI EIDAIEYQPM MTAFGETTIT
VPAFVRLKKK IGELQPQRLQ IVHTDEDPNS ETFGQLVGDC FFFAVNKSKA ASRGLSDLFC
LADWLDVLDQ MIFDFADRAR FLNMWVWDVT LSGADDKRVK EFNRDLTKAP PRQGGVLTHN
EAVKIEAKTP DMKGQDFSAG AQMVKTYGLG GIGLPPFMFA DPTDVNRSTG EVMEGPTGKK
LTDRQNEIKR LIRQVVQFVL HQAVAHGVLS AKADLDFDLQ VPDLLIKDLQ KAGTTMQAVT
GSLAAAKDEG WITEETAARS FHVVLSQIGV QVDSQDEFAA AQKEKRQRDA TAQNSLNDQK
NLADALNNLG NTDAKKAASG VVQ