Gene Acid345_2870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2870 
Symbol 
ID4070389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3415163 
End bp3416368 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content58% 
IMG OID637984888 
Producthypothetical protein 
Protein accessionYP_591945 
Protein GI94969897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCG CCATCTCAAC GGCTCTGCGT CGAGCGTTTG AAGACGCCGC ACCGACCGTC 
CGCGTGACGT GCGAAGAACT GGCGGTTTGG CTGCGGGACC ATTATTACCA AGGCTATGAG
CCCAGCGACC TGCGCTCGGC ACCGTCGTTC CAGAAGCCGT TTTTCAAACA GCCTGGCATT
TCTCACCTCA TCGACTATCT CAGTGAGCAC AGCGGACGGC GCGCGCGAAA GTTTCTGAGC
GTGCCACTGT CGTACACGCC GACGACGCTG GCGCTGGCGC TGGGTGCCTG CGCGGAACTC
TATCCGTTCG ACCCCGGGGT GTTGCCGACG ATCATCGTGC TGCGCGGCGA ACTGCTGCGG
TTGAGGCATC CCAGTGAAAA AGAATATTGC TGGGGCGATG AGTACGATGA TCGTTTTGCG
CGCGATCTGA ATGCGCCCGT GTTTAAGCCG GATACATTCG TCTCGTATCT GTGCGGATCG
GCGATGATGA CGCTCGCCGA AAAGGTACGC GATGAGTCGG CGGTTGCCGT GGCGGAGTCG
GTGGGGCGGT ACTTCATCAC GCGGCTGAAT CGCTCGGTGG ATGAGTCGGA CGAGATGTGC
TTCAGCACTT CGCCTGACGA CCAGGAAAAG ATATTTCACA AGAGCGCGCT GGTCGGCAGC
TTCCTGGCGC GCCTTTGGCA GTGGAACCGC AATGACGAGT ATCTCGAACT CGCAGTGCGG
GCGATGAATT TCCTGCGCAA CGCCCAACTC GCTACCGGCA TGTGGTACTA CGGGCTAGGA
GAAGAGAAGC GCAACATCAA TGCGATCCAT ACGTCGTACA ACGTGATCGC GATGAACGAT
TACCGCTTCT ACTCTGGAGA TCGCAACTTC GACGACACGA TTTTCAACGG CAACGAAGCG
TTTAAGCGAG TGTTCTTCGA GACTGATGGC AAGCCGAAGA TGTTTGTGCA TCGGCTGTAT
CCGGTGGACG TGCGCGCGTG CTCGCAGGCG ATTGAGCACT TTTCGGCAAT GATGAAAGAC
GACATTGATG CCCCAGATCG GGCGATCAGC ATCCTGCAAT GGACGTTGCA GAACATGCGC
AATCCCGACA ACAGCTTTGC GTATCGCAAA TATGCAACCG GCACCCAGCG CATGGCTTAC
GTAGCGTGGG GACAAGCGCA TATGTTTTAT GCGATGTCGC GCTTGCGGAC GGCCTTCGCG
CTTTAG
 
Protein sequence
MSSAISTALR RAFEDAAPTV RVTCEELAVW LRDHYYQGYE PSDLRSAPSF QKPFFKQPGI 
SHLIDYLSEH SGRRARKFLS VPLSYTPTTL ALALGACAEL YPFDPGVLPT IIVLRGELLR
LRHPSEKEYC WGDEYDDRFA RDLNAPVFKP DTFVSYLCGS AMMTLAEKVR DESAVAVAES
VGRYFITRLN RSVDESDEMC FSTSPDDQEK IFHKSALVGS FLARLWQWNR NDEYLELAVR
AMNFLRNAQL ATGMWYYGLG EEKRNINAIH TSYNVIAMND YRFYSGDRNF DDTIFNGNEA
FKRVFFETDG KPKMFVHRLY PVDVRACSQA IEHFSAMMKD DIDAPDRAIS ILQWTLQNMR
NPDNSFAYRK YATGTQRMAY VAWGQAHMFY AMSRLRTAFA L