Gene Acid345_2573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2573 
Symbol 
ID4070536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3036332 
End bp3037867 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content56% 
IMG OID637984590 
ProductCna B-type protein 
Protein accessionYP_591648 
Protein GI94969600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTGC ATCCGATTCT CCCCTTTCGA AAAGTTGTAC TTCTCGTCGT CGCAGCCCTT 
CTTCTCCCTA CGCTCGTGTT CGCGCAGGCT TACTTCGGTA CAGTAAGCGG AAGCATCGTC
GATACCAGTG GGGCCGTCGT GCCCGGCGTC AATGTGACGC TGACTGACAT GCAAAAGGGC
TTCACATTCC ATGCGCAATC GGGCAGCGAT GGACACTACC TCTTCCGCTC CATTCCCCCG
GGGGTTTATC GGGTTTCAAC CGAGGCTACG GGCTTCGAGA AAGCGACCAG CACCAACGTC
AAAGTTGATA TCAACGAGAA CGCCACAGCG AACCTGACTC TCAAGGTGGG CACGACCACC
CAGACCGTGG ATGTTGCCGG GAATGCTCAG AAGATTGAGA CCGAGGACGC GGAAACCGGC
CAGGTCATCA ATCGGAAATT CATCAACGAT CTGCCCCTCA TTAGCCGCTA CGTGATGGAC
TTGACCTACT TAGCTCCCGG CGTGGCCGAC ATGGACGACC AATGCCCGAA CTGCGGTGGA
ACGAATTTCG TTTCTAACGG GAGCCGTGGT GCTTCGGCCG ACATCTTGCT AGACGGTGCC
TCCACCACGA ATTTCGAACC GAACGGTGGC GTGACGCAAG CAACGTACTC GCCGTCTCCG
GAAGCGGTAG AAGAGTTCAA GGTCGAGCAA TCGAATTTCA GCGCGGAATA CGGATTCTCA
GGAGCGAGCG TCATCAACAT GGTGACGCGC TCTGGGACCA ACAAGTTTCA CGGCAGCGTG
TACGACTATC TGCGTAACCA GGTGCTGGAC GCCAACAACT GGTTCAGCAA TTACTACGGG
GATCCCATAC CGGCGCTGAA GCGGAACAAC TATGGAGTCA CCATCGGCGG ACCGATCATT
AAGAACAGGA CCTTCTTCTT TTTCGATTAC GACGGTTTCC GCGAATCATC CGCGAGTTCG
GCAACGGCGG GTGTTCCCAC CGACGCCATG CGGGCCGGCG ATTTCGGCGA GGTATGCAGT
GAAAAGGGCG GCAGCTTTGA CTCCCACGGC ATCTGCAGCG TTACGGCCGG ACAAATCTAC
GATCCCTATC AAGGTGTGTA CGATCCCGGC AACGGTGGAA CCAACCGCAA TGTCGCCATT
CCGTACAACA ACCTGGCCAC TTACGCGAGC CCGGGAAACG CGGCGTTGAT TGGCAGTCCG
TATCAACTTC CGGTTCATCC CGGAAACCTG ATCGATCCGG TAGCGCAAAA GATGATGGGC
CTCTTCCCGA AGCCGAATAT TTCGGGTGGA TCGATTTACC AGAACTGGTA TAGCTCAGGT
GCTTCACAGG GATTCAACGA TCAACTCGAC TTCAAGATCG ACCACCGTGT CTCGGAGAAG
AACCTTCTCA GCGGTAAGTA CTCGCACCAC TGGAACCACA ACGCTGGGTT CAACTGCTTC
AAGAATTTCA TTGATCCTTG CCAGGGCGGC CCCAACGAGT CAAGCGCAAA CCTGTTCGCG
ATCATGACAC CCATACGTTC AGCCCGACAT TCCTGA
 
Protein sequence
MALHPILPFR KVVLLVVAAL LLPTLVFAQA YFGTVSGSIV DTSGAVVPGV NVTLTDMQKG 
FTFHAQSGSD GHYLFRSIPP GVYRVSTEAT GFEKATSTNV KVDINENATA NLTLKVGTTT
QTVDVAGNAQ KIETEDAETG QVINRKFIND LPLISRYVMD LTYLAPGVAD MDDQCPNCGG
TNFVSNGSRG ASADILLDGA STTNFEPNGG VTQATYSPSP EAVEEFKVEQ SNFSAEYGFS
GASVINMVTR SGTNKFHGSV YDYLRNQVLD ANNWFSNYYG DPIPALKRNN YGVTIGGPII
KNRTFFFFDY DGFRESSASS ATAGVPTDAM RAGDFGEVCS EKGGSFDSHG ICSVTAGQIY
DPYQGVYDPG NGGTNRNVAI PYNNLATYAS PGNAALIGSP YQLPVHPGNL IDPVAQKMMG
LFPKPNISGG SIYQNWYSSG ASQGFNDQLD FKIDHRVSEK NLLSGKYSHH WNHNAGFNCF
KNFIDPCQGG PNESSANLFA IMTPIRSARH S