Gene Acid345_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3797 
Symbol 
ID4071081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4486736 
End bp4488151 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content52% 
IMG OID637985820 
Producthypothetical protein 
Protein accessionYP_592871 
Protein GI94970823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.533548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGAAGA ATGTTTTGAT ATTCCTGCAA GTGACCGTAC TCGGATTCGC ATCCGCGGCT 
TTCGCCGCGA ATGCCTCAAG TGTCGTTCGC CTGAAACCTG ATCAGGATTT CGCGTCCATC
ATCAAGAATG CCCCGGCTGG CTCGCAGTTT GAGTTCGCGG CGGGTGACTA TCGCATGGCT
TCGATCACCC CGAAAACTGG CGATTCGTTT CGTGGAAACG GGCAGGCTGT TCTGAATGGG
GCGAAGCTGG TCACGTTTAG GCAGGACGGG AAACTTTGGA GCATCAGTGA GCAGTTGGGT
CGCTCGAGAA ACGGATCCTG CGAGCCGTCG CGCCCAGCAT GCCTGATCTT GAACGATCTC
TTTATTGACG ACAAACTGCA GACCCTGGTG CTTGATAGAT CTCAACTAAC GGTTGGAACC
TGGTACTACG ACCAGGCGTC TTTGAAAGCG TACATCAGTG TGGATCCGAC CGGCCATAAA
GTAGAGTTAG GATCCGCGCC GCTTGCTTTT GCGGGATCGG CGACCGATGT GACGATTGAC
GGCTTTACTG TCGAAAAATA TGCTAATTCG CCACAGACTG GGGCGGTCGG TGGGTATAAC
GGCAGCGCAC ACTCGTGGAT AATTCGTCAC GTTGAGACGC GTTGGAATCA CGGTGTAGGC
ATTGCAGTGG GCAGTAACAG CATAATTCAG TCAAGTAATT CTCATCATAA TGGCCAACTC
GGAATGGCTG CACACGGCGA GAATATCCAG ATTTTGGATA ATACGATCTC GAACAACAAC
TATGCGGGTT TCAAAATCGT TTGGGAGGCG GGTGGAACCA AATTCTCTGG CTCTGACCAC
CTTTTGGTTC GTGGGAATGT TGTTGAAGCG AACTACGGTA ACGGGCTTTG GACCGACATC
GATAACATCC ACGTAGTCTA CGAAAAAAAC AGAGTTCTCA ACAATACGGG CGCCGGAATT
GTGCATGAAA TTAGCTATGA TGCTGTGATC CGCAACAATT TCGTGTCCGG CAATCGAGTC
GGAATCATCA TCATTCTTTC TTCGAATGTA CAAGCTTATG GCAACGTCGT TGAGGTGCCT
CCGAACGGTA CGGACGCCAT ACGAGTTGCG AATGGCAACC GCGGCGAAGG GAAATTTGGT
CCATATGTCG CCCACGATAT TCGGGTGTAT GACAACATCA TTACGTTCCT GGGATCGAGT
GGGCGCAGCG GACTTAGTGG GCCATTGGAT ACGGCGAGAA ACGTCGTTTT CGAAAATAAC
CAATATCACC TGCTCGGTGG TGGAAACGCT CACTGGATAT GGGGATCTCC CAATCCAGTG
CCATTGAGTG AAGTGCAACG TGTCGGCTCG GACAAAGGGG CAAAGGTCTC ACGAGAACCT
GCGAAGATGA TCGATCCAAC GCGATCCCCA GAGTAG
 
Protein sequence
MAKNVLIFLQ VTVLGFASAA FAANASSVVR LKPDQDFASI IKNAPAGSQF EFAAGDYRMA 
SITPKTGDSF RGNGQAVLNG AKLVTFRQDG KLWSISEQLG RSRNGSCEPS RPACLILNDL
FIDDKLQTLV LDRSQLTVGT WYYDQASLKA YISVDPTGHK VELGSAPLAF AGSATDVTID
GFTVEKYANS PQTGAVGGYN GSAHSWIIRH VETRWNHGVG IAVGSNSIIQ SSNSHHNGQL
GMAAHGENIQ ILDNTISNNN YAGFKIVWEA GGTKFSGSDH LLVRGNVVEA NYGNGLWTDI
DNIHVVYEKN RVLNNTGAGI VHEISYDAVI RNNFVSGNRV GIIIILSSNV QAYGNVVEVP
PNGTDAIRVA NGNRGEGKFG PYVAHDIRVY DNIITFLGSS GRSGLSGPLD TARNVVFENN
QYHLLGGGNA HWIWGSPNPV PLSEVQRVGS DKGAKVSREP AKMIDPTRSP E