Gene Acid345_2624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2624 
Symbol 
ID4072033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3094736 
End bp3096367 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content61% 
IMG OID637984641 
Producthypothetical protein 
Protein accessionYP_591699 
Protein GI94969651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.672502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.615822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA CCAAGCCCGC CCCGCCGCCG CTACCGATTG ACTACTCGGG ATTCGTGGCG 
CTGTGGGGCG CGTTATTTTC TTTCGCTGCA TTCCTGTACT TCTACCGCCA CGGCGAGACG
CTGTTGTACG GCGACGCCGT TGCGCATATC AACATCGCGC GACGGATCTT CGATTGTCGC
GAGCCGGGAT TGCGCCAGCT CGGAACCGTG TGGCTGCCGT TTCCGCACCT CGTGATGGCG
CCGTTTTTGC TCAATGACAA CTTCTGGGTA AGCGGCATCG GCGGCTCGCT GCCTTCGATG
GTGGCGTTCG TGCTGGGTGC AGTCGGGCTC TACCGGCTCG TAGCGGCGCG AACCGCGCAC
TGGGTCGGCG GAGTGGCGGC GGGGATCTAT TTGCTCAATC CCAACCTGCT GTACATGCAG
TCGACCGCGA TGGGCGAGAG CATCTACCTC GCGCTGATGA TCTGGGCGGT GTTTTACCTC
GACGCATTCG CGCGCGGTTT GCGTGACCCG GAGCAACCGC TTCGACCTGC AAAAGCATTG
ACGCGTTGCG CAATGGTACT CGCGGCAGCG ATCCTCACTC GCTATGACGG TTGGTTCTTC
ACGTTCATCA TTGCGCTGGC CGCGCTGTTT ATTCTGGTGC GCAACTGGAG CTTGCAGAGC
GACAAACAAA AGCGTTTGCT GACGCGCTCC GCCATTCACT TCACGCTGCT GTGCACACTC
ACCCCGGCTT TGTGGATGAG CTACAACTAC TGGCTCTCAC GACATCCGCT GGACTTCGCA
ACCGGGCCAT ATTCCGCGAA GGCGATTGCC GCGCGCACCA CGCCGCAAGG CGCGCCGCCC
TATCCCGGCA AAGACCACAT GGCTACCGCG GCGACGTATT TCCTAAAGGC CGCGAAAGCC
AACATGGCGG AAGGCCGCTG GCAGTTCTGG CTGATGGTGG CTGCGGTGCT CGGGAGCGCA
ATTGCGGCGG TCGTGGTTCG CGGCGGCTGG CTTTGGCTTT TGCTATGGAC GCCGTTGCCG
TTTTACGCGC TTTCCATCGC CTACGGCAGT GTGCCTATCT TCGTGCCGGA GTGGTGGCCG
TTCTCCTACT ACAACGTGCG TTACGGGATG GAACTGATTC CCGTCTTCTG CGTGAGTGTG
GCGTTCCTCG CGTCGCTAGG GAAGCGCGCG ATGCTGCCGG GACGCTGGCA GATCGCACTG
CCGGTTGTCG TGCTGGCGAT TGTGGTAGGA GGCTATTACG CGTCGTGGCG CGCGACCCCG
ATTTGTCTGC GCGAGGCCCA GGCCAATGGC CGCAACCGCA TGTCGGAGGA CGCTGCCGTT
GCTCGCTACA TCCAGATGAT GCCGCCGGAT ACGACCATCC TGATGCAGAC CGGTTCCTAC
GTCGGCGCGT TACAGATGGC GGGACGGCAC CTCGACAGCG TGGTTTGGGA AGGGCTCTAT
TACCAGTGGG AACTAGCCCT CAATCAACCG GCAGAAAAAG CGGACTACAT CATCGCCTTC
GGTAACGATG AGGTCGCGCA AGCCGTAAAG GCGCATCCGC AAGGCCTGGA AAGCATCGTG
GTGTTGCGTG TTGGCGATCA GGCCCCGGCC ACGATTTATC GCAGCACCGC TCGAAACGCA
CGGCCGCTTT AG
 
Protein sequence
MKKTKPAPPP LPIDYSGFVA LWGALFSFAA FLYFYRHGET LLYGDAVAHI NIARRIFDCR 
EPGLRQLGTV WLPFPHLVMA PFLLNDNFWV SGIGGSLPSM VAFVLGAVGL YRLVAARTAH
WVGGVAAGIY LLNPNLLYMQ STAMGESIYL ALMIWAVFYL DAFARGLRDP EQPLRPAKAL
TRCAMVLAAA ILTRYDGWFF TFIIALAALF ILVRNWSLQS DKQKRLLTRS AIHFTLLCTL
TPALWMSYNY WLSRHPLDFA TGPYSAKAIA ARTTPQGAPP YPGKDHMATA ATYFLKAAKA
NMAEGRWQFW LMVAAVLGSA IAAVVVRGGW LWLLLWTPLP FYALSIAYGS VPIFVPEWWP
FSYYNVRYGM ELIPVFCVSV AFLASLGKRA MLPGRWQIAL PVVVLAIVVG GYYASWRATP
ICLREAQANG RNRMSEDAAV ARYIQMMPPD TTILMQTGSY VGALQMAGRH LDSVVWEGLY
YQWELALNQP AEKADYIIAF GNDEVAQAVK AHPQGLESIV VLRVGDQAPA TIYRSTARNA
RPL