Gene Acid345_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2311 
Symbol 
ID4071465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2738896 
End bp2740326 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID637984327 
Producthypothetical protein 
Protein accessionYP_591386 
Protein GI94969338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.466127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.419829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAA GAATATTGGT GGGCATCATG CTGTCGCTCG CAGCGGCGTG CCTGGCAATC 
TTGATCGCAT GCGGTGGCAG CAGCAGCATG AACTCCAACA AGACGACGGG GACAGTCAAC
CTCTCAGTCA GTGATCCGCC CACGTGTGCG GCCCCAGCCG GCCCTTATTC CAACGTCTGG
GTGACGATCA AAGACGTGCA AATTCACCAG AGCGCGTCCG CGGGACCAAG TGACGCGGGC
TGGGTGGATC TCACGCCGAA CCTCAAATCA GCACCGCAGC AGGTGGATCT GCTCGGGATC
GCCGGCAACA ATTGTTTCCT CGCGATGCTT GGTTCAAACG TCGAACTGCA AGCCGGGAGC
TACCAGCAAA TTCGAATCTA TCTCTCCGAC AGTTCCGACG CCAGCAAACT CACGACGAAC
CATTGCAGTG GATCCGACGT GAATTGCGTT GTCACTGGCG GAAACACTTT CACTCTTGAG
CTCTCCAGCG AATCGAATAC CGGCATCAAG ATTCCATCCG GACAACTGGC AGGCGGCAAC
TTTACGATTG CGGCCGGAGA AGTGAAGGAC CTCAACATCG ACTTCGACGC CTGTCTCTCG
ATCGTGCATC AAGGCAATGG TAAATATCGG CTTAAGCCCG TGCTGCATGC CGGAGAAGTT
CAACTGACAT CCTCCTCGGT TACGGGTTCG CTTGTAGATA GCATCTCGCA TACGTCCATC
GTTGGTGGTG CGGCGGTGGT TGGGCTGGAG CAGAAGGACG CGAACGGGAT CGACCGCGTC
ATCATGCAGA CGGTTACGGA TGCCCGCGGC AACTTCGTTT TTTGCCCCGT GCCAGCCGGA
ACGTACGACG TTGTAGCCGT GGCAGTAAAT GGAGCCGGAG TGGCCTACGC TGCCACGATC
ACGACTGGCG TCCAACCCGG GAATGCTTTA GGAAATGTTC CGATGGTGGC TCAGGTAGGA
GTTCCACTCA CCAACGCGGA AATTGATGGG GAAATTACTT CGAGCACGGG GAGCGCAGCG
GCAGCGGCGG ATATAACGTT CTTCGCGATG CAATCGGTCT CTATCGAGGG CTCGACGGTG
AACGTGATTA TTCCACTGGC ACAGCAATGG AGCTCGGCAA CCGCGTCCAT GACCACGGAT
CCGACGTCTG CGTGCGCGAC GGCGACGGCC GCATGCGTCG CCTACCAGGT GTTCTTGCCG
GCGATGTGGC CGAATGTTGG TGCATACGCT GCCTCCGGCG CGACTTACAC CCAGAACAGC
GCGACGCCAG TAACGTACGC GATCGGCGCC GATGCGTTTA TTCCGGGATC GGCAGGAACG
TCCGACTGCA CGCCGCCGGG TGAGATCACG ACCACCGGCG GTACGCCGAT GACCGTGTCG
CCAGGATCGC CGACCCCCGC CCCGACATTG GCGTTCACCG GGTGCCAGTA G
 
Protein sequence
MKPRILVGIM LSLAAACLAI LIACGGSSSM NSNKTTGTVN LSVSDPPTCA APAGPYSNVW 
VTIKDVQIHQ SASAGPSDAG WVDLTPNLKS APQQVDLLGI AGNNCFLAML GSNVELQAGS
YQQIRIYLSD SSDASKLTTN HCSGSDVNCV VTGGNTFTLE LSSESNTGIK IPSGQLAGGN
FTIAAGEVKD LNIDFDACLS IVHQGNGKYR LKPVLHAGEV QLTSSSVTGS LVDSISHTSI
VGGAAVVGLE QKDANGIDRV IMQTVTDARG NFVFCPVPAG TYDVVAVAVN GAGVAYAATI
TTGVQPGNAL GNVPMVAQVG VPLTNAEIDG EITSSTGSAA AAADITFFAM QSVSIEGSTV
NVIIPLAQQW SSATASMTTD PTSACATATA ACVAYQVFLP AMWPNVGAYA ASGATYTQNS
ATPVTYAIGA DAFIPGSAGT SDCTPPGEIT TTGGTPMTVS PGSPTPAPTL AFTGCQ