Gene Apre_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0144 
Symbol 
ID8396895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp170502 
End bp171503 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content36% 
IMG OID644994482 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003151917 
Protein GI257065661 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4209] ABC-type polysaccharide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AAAATCAGGG TCAAGTAAAA GTAATTGACC AAAAGGAATT AGAAAGAAGA 
AAGAAAAAAG CCAAAAAACC ATTTAGCCAA AGATTTAAGA AGAATCTACC TCTTATGGCC
TTCTGTCTAC CAGGCTTTAT TTGGTTTGCC ATAATGAGCT ATTTGCCTAT GTTTGGGGTA
ATTATCCCCT TTAAGGACTA CAAGGTTTTT TCTAAGAATT TCTTCTATAA TCTCTTTCAT
AGCGAATGGA TTGGATTTGA TAACTTCAAA TTCTTCTTCC AAAGCAATGA CGCTTGGGTT
ATCATAAGAA ATACGATTTT GTACAATGCT TGCTTTATAG TTGTAAATAT AGTTCTTGCT
ATGTTTACCG CCATAGCTCT TCATGAGCTT TTAAATAGAA AAGCTGCGAA GTTCTATCAG
ACCTCGCTCT TTCTGCCATA TTTCTTATCT TGGGTTGTAA TCTCTTATGC AGTATTTGCC
TTTCTATCAC CAGATAAGGG CTTGATAAAT TCTCTTATTA TGAAATTCGG TGGAAGAAGT
AAGAATTGGT ACACAACCAA GACTTGGTGG CCAGTATTTT TGGTAATCAT TAATGCTTGG
AAAGGCCTAG GTTATAATAC AGTAGTTTAC CTATCAGCCC TATCAGGAAT TGATAAGACC
TACTACGAGG CGGCTGTAAT GGACGGAGCA AGTAAGTGGG AACAGATAAA ATATATCACT
ATACCAATGC TAAAGCCTGT TGTTACTGTT TTATTTATAA TGAGTCTTGC TAATATATTT
AGGGCAGACT TTGGTTTGTT CTACCAAGTT CCAAGAGATT CGGGACCTTT GTATTCAGTT
ACAAACGTAA TCGATACCTA TGTATTTAGG GCCCTTATGA AAAATGGAGA CATAGGCCTA
TCATCAGCCG TATCGCTTCT CCAATCAGCA GTTGGAGCAG TCCTAATAAT AGGTGCCAAT
AAGATTGTCA AAAAATATGA TCCACAAAGA TCACTTTTCT AG
 
Protein sequence
MENKNQGQVK VIDQKELERR KKKAKKPFSQ RFKKNLPLMA FCLPGFIWFA IMSYLPMFGV 
IIPFKDYKVF SKNFFYNLFH SEWIGFDNFK FFFQSNDAWV IIRNTILYNA CFIVVNIVLA
MFTAIALHEL LNRKAAKFYQ TSLFLPYFLS WVVISYAVFA FLSPDKGLIN SLIMKFGGRS
KNWYTTKTWW PVFLVIINAW KGLGYNTVVY LSALSGIDKT YYEAAVMDGA SKWEQIKYIT
IPMLKPVVTV LFIMSLANIF RADFGLFYQV PRDSGPLYSV TNVIDTYVFR ALMKNGDIGL
SSAVSLLQSA VGAVLIIGAN KIVKKYDPQR SLF