Gene Acid345_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1536 
Symbol 
ID4072927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1875160 
End bp1876590 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content56% 
IMG OID637983545 
Productamino acid transporter 
Protein accessionYP_590612 
Protein GI94968564 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0652543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.377188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAACA GCCAGACCGA AACCCCGCAC CTTAAGCGCG TCCTTGGAAA ATGGGACCTG 
GTGCTGCTGT TCGTGGTTGC CGTTTTCAAT CTGAATGTGG TTCCTTCGAT TGCCGCCAAC
GGCGGGGTCA CGATCTGGCT CTGGCTTATT TCACTGGTGC TCTTCTTTTG GCCGCAAGGC
ATCGCGGTCA TCGAACTCGC TCACCGCTAC CCCGGCGAGG GCGGCGTGTA TCTGTGGGCA
AAGGAGGTGT TCGGCGACTT CCACGGCTTT CTTTCCGGCT GGTGCTACTG GACCAACAAC
ATGCTCTACG TTCCGACGGT GATGCTCTAC TTCGTCGGCG TGTCTGTGTA TGTACTTGGC
CCTTCACACC AGGGTCTTGC GGATAACAAG GTTTTCGCGC TAAGCGCATC TTTAGTGCTG
CTGATCGTGC TGGTTGTGCT CAACGTTATC GGGCTTGGCG TTGGAAAGTG GATCAACAAC
ATCGGCGCGA TCGGAACCTT CATCGCCGCC GCAACACTTC TGATCCTTGG ACTTGTTGTC
GGATTGAAGC AGGGCGCTTC GATTCACTAC GTAAACTTCG CCATCCCGAC CGATCGAAGA
TTTCTGTTTA ACGCCTTTGG CGTGATCTGC TTTGGATTGG TCGGACTCGA ACTCGCCTCG
ATTATGGGCG ACGAGATTCA GGAACCTCGT AAAACGCTAC CAGGCGCAGT GTTGTGGGGC
GGAGTCATCT CCGGCTTGCT CTATATATCG GTGACGCTCA CCCTGCTCGT TGCTGCAGGG
AAAAACGGTA TCAGCGTTCT GCAAGGGATC GTGCAGGCCG TCGGACATCT TGCTGAGCAA
GCACACGTGA CCTGGATTAT CGTCCCATTT GCATTGATGC TCAGCCTGGC GATTGCCGGC
ATTGGATCCG CATGGCTCGC TGGATCAGCG CGAATTCCCT TCGTTGCCGG CCTCGACAAC
TACATGCCGA GCTGGCTCGG ACGCGTGCAT CCGAAGTACG CCACGCCGTA CGCAGCTCTG
ATCGTGCATG CTGTGATCTC ACTGTTGCTG GTACTCGTGA ACTTCCTCGG CGGCGCCGGC
GTACAGGAAA CGTTTCAAAC GATGCTTTCG CTCGCAGTCG TCCTGCAACT CGTTCCCTTC
CTTTATATGT TTGGCGCGCT GGTGCGCTTC GGGTTGAAGT ACGAAGCGGG TAAGGGCGTT
TACGGGCGTC CAGCGCTGTT GCTCTCCGGC ATCAGTGGAT TTATTACGAC TACGCTCGGA
ATTGCGCTCG CCTTTTTCCC GGCGCAAACC ATCAAATCTG TATCGCAGTA CGAATTTAAA
ATGTTTGGCG GCACAGCGTT CTTCATTGGG CTTGCCGCGT TCTTCTTCTT TATTTATGGC
GGCCGCAAGG CCCGCCTCGC TGCGCAATCC GCAGCCAACC AAGCTGCCTA G
 
Protein sequence
MTNSQTETPH LKRVLGKWDL VLLFVVAVFN LNVVPSIAAN GGVTIWLWLI SLVLFFWPQG 
IAVIELAHRY PGEGGVYLWA KEVFGDFHGF LSGWCYWTNN MLYVPTVMLY FVGVSVYVLG
PSHQGLADNK VFALSASLVL LIVLVVLNVI GLGVGKWINN IGAIGTFIAA ATLLILGLVV
GLKQGASIHY VNFAIPTDRR FLFNAFGVIC FGLVGLELAS IMGDEIQEPR KTLPGAVLWG
GVISGLLYIS VTLTLLVAAG KNGISVLQGI VQAVGHLAEQ AHVTWIIVPF ALMLSLAIAG
IGSAWLAGSA RIPFVAGLDN YMPSWLGRVH PKYATPYAAL IVHAVISLLL VLVNFLGGAG
VQETFQTMLS LAVVLQLVPF LYMFGALVRF GLKYEAGKGV YGRPALLLSG ISGFITTTLG
IALAFFPAQT IKSVSQYEFK MFGGTAFFIG LAAFFFFIYG GRKARLAAQS AANQAA