Gene Acid345_0860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0860 
Symbol 
ID4068954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1071545 
End bp1072828 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content59% 
IMG OID637982869 
Producthypothetical protein 
Protein accessionYP_589939 
Protein GI94967891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAATG CCCTTGCGCT CGCCGCGGTC ACGGCTGTGC TCCAGTCGTA TCTGAACGCT 
GTGTACAACA ATCCGTCATC GGTCTTGGGC AGCGTCTCCG TGACCGCTAT TGCTCCTGAC
CTCATTCAGG GCGGTATTGC CGGCGGCGGC AACGCGCCTC TCCAGGTAAA TATCTTTCTC
CACCAAGTCA CGCTAAACGC CGCGTGGCGA AACATTGAGA TGCCAACCCT TGCGCCGGAC
GGTCAAACCC GCATTGCGAA TCAACCCCTC GCGCTGGACC TCCACTATCT TCTGACCGCG
TATGCGCCCG AAGATAGCCA GGCCGAAGCC TTGCTTGGTC TTGGCGTCTT CTTCTTGCAC
CAAAATCCGA TGATTGCCCG CGCAGATATC GCTTCGGCGC TAGCAGCCCT TCCACCGAGC
TATCCAGCTC CATTCGCTAC CGCGCTCGGT CTCTCGGGAC TTGCCGACCA GGTCGAAATG
ATCAAGATCA CTCCCGCCAC TCTTGGTCGC GAGGAGATCG CGTGGCTCTG GACCGCCCTC
AAGGCCGACT ACCGCCCGAC GTTTCCCTTT CAGGTATCCG TGGTCCTGAT CCAGCCGCAG
AATCCAGTAT TCGCCGCTTT ACCCGTACTA CAACGGATTA TCGAAGCGAA GCCGCTGTCT
CCAATTCCAA CGTTGACCGA AGCTGATCCG CCAAACAAAC AGCCTGTCGC ATGTCTCGGA
GATACGGTCA CCGTTCAAGG CGCATTCCTG AACGGAACCT CCGCCGTACG GTTGGTCAAT
CCACAGCAGG GTCTTCAGTC GGATATCACC GCCATTACGA ATGCCACGAA TGTGTCTTTT
AAGTTTGGTA TTCCTAACCC CGTGCTACCG TCCCCACAAC TTCATCCCAC GGACCTCCCC
GCAGGCGTTT ACGTGGTCTC CGCCAAGGTC GCATCGGATG GCGACACAGT GGACACCAAT
GGCGTTGCCC TCGCGATTGC GCCGAAAATC GATGCGTCTT GGGCGCCCGG AACGATCCCA
TCAGGTCTAA ACGTTTCCGT CTCCGTGCCA TGCGCACCCT ATCTCCGCCC TGGGCAGGCT
GTTCAACTCC TTATCGGAAG CCAGGCGGCT CCAGCCGACA CCTTCGATAC TCCAACCAAT
TCTCCGAGCT TCACCTTTGC CAACCTCACC GCCACTGCCA CACCCGTTCC AGTGCGGCTC
CGCGTCGACG GCATCGACAG TCCAATCATC GACATGACGG CGAAGCCTCC GAAATTTACC
GGCCCGTCCG TGCAGGTGAC GTAA
 
Protein sequence
MSNALALAAV TAVLQSYLNA VYNNPSSVLG SVSVTAIAPD LIQGGIAGGG NAPLQVNIFL 
HQVTLNAAWR NIEMPTLAPD GQTRIANQPL ALDLHYLLTA YAPEDSQAEA LLGLGVFFLH
QNPMIARADI ASALAALPPS YPAPFATALG LSGLADQVEM IKITPATLGR EEIAWLWTAL
KADYRPTFPF QVSVVLIQPQ NPVFAALPVL QRIIEAKPLS PIPTLTEADP PNKQPVACLG
DTVTVQGAFL NGTSAVRLVN PQQGLQSDIT AITNATNVSF KFGIPNPVLP SPQLHPTDLP
AGVYVVSAKV ASDGDTVDTN GVALAIAPKI DASWAPGTIP SGLNVSVSVP CAPYLRPGQA
VQLLIGSQAA PADTFDTPTN SPSFTFANLT ATATPVPVRL RVDGIDSPII DMTAKPPKFT
GPSVQVT