Gene Apar_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1335 
Symbol 
ID8414220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1501196 
End bp1502689 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content51% 
IMG OID645022932 
ProductPTS system, trehalose-specific IIBC subunit 
Protein accessionYP_003180350 
Protein GI257785133 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01992] PTS system, trehalose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.135548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00345051 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAGT TCGACCATGA TGCTCGTGAA CTCCTCGAGC TCGTTGGCGG CAAAGACAAC 
ATTGCTGCGG CTTCACACTG TATGACACGC ATGCGATTTG CCTTGAAAGA TCCTTCAAAA
GCGGATGTGG CTGCCATTGA AAAGCTTGCA TCCGTTAAAG GAAGTTTCAC GCAAGCCGGC
CAGTTTCAAG TTATTATCGG CAATGACGTC GCCGACTTTT ACGACACCTT TGTCGGTATT
TCTGGCGTAA GTGAGGCTTC AAAACAAGAC GTCAAATCAG CCGCCCTTCA AAACACAAAC
ATTTTGCAGC GAGCCATGGG AGCCATTGCC GAGGTCTTTG CTCCCCTGAT TCCCGCAATC
ATTACCGGCG GCCTTATTCT TGGCTTCCGA AACGTGCTTG GCGAGATGCC CTACTTTGGA
CCTGAAGGCA ACCAGACCCT CGCTTCGCTT TCCGTCTTCT GGACGGGCGT ATATAACTTC
TTGTGGCTTA TTGGCGAGGC AGTCTTCCAT GCCGGCATCC CTGTAGGCAT TTGCTGGTCA
ATCACAAAGA AAATGGGCGG GACCCCCATG CTCGGCATTG TGCTTGGCCT CACCCTCGTT
TCCGGCCAGC TCATGAATGC CTATGCTGTT TCAGGAGCAA CCGCCGCTGA TTGGGCTACC
CATACATGGA ATTTTGACTT CGCGCAGGTC CGCATGATTG GATATCAGGC TCAAGTTATC
CCCGCTATTC TCGCTGCGAT TACCTTCAAC TACCTCGAGC GATTCTTTAA GAAAATCACG
CCATCCGTCA TTCAGATGAT CGTAGTGCCC TTCTGCTCAC TTTTGCTTGC CGTTATGGCT
GCTCACTTCG TGCTTGGCCC CATTGGCTGG ACCATCGGAT CTTGGATCGG AAACATTGTT
CTCGCAGGCA TCACAAGTCC GTTTGCATGG CTCTTTGGCC TTATCTTTGG CGCCGTATAT
GCTCCCCTTG TCATCACTGG CTTGCACCAC ATGTCCAATG CCGTTGACAT GCAGCTTATT
GCAAGTTCCA ATAACTATGG CACTCCACTG TGGCCCATGA TCGCACTTTC CAATATTGCT
CAGGGTTCAT CGGTTCTCGC CATGTCTGTC CTTCAGAAGC ATGACGAAAA TGCTCAGCAG
GTAAACATCC CTTCCATCAT CTCTTGCTAC CTTGGCGTTA CCGAACCCGC TATGTTTGGC
GTCAACCTCA AATACGGTTT CCCCTTCGTA TGCGCTATGA TTGGCAGTTC CATTGCAGGA
GGTGTCTGTA CGGCCTTTGG TGTCAACGCT CTCTCCATTG GCGTTGGTGG ACTCCCCGGA
ATTCTTTCAA TTCGCCCCGA GTTTATCGGC ATCTTCGCCG TTTGCATGGC CATCGCTGTA
GTGGTTCCAT TTGTTCTTAC ATTCACCATC GGCAAACGCC AAGGTATTGA TAAAGGTATC
GATACCAATG TCGTCACGCT TGATGGCGAG AAAAACAGTG CCCTTTCGGC CTAA
 
Protein sequence
MAKFDHDARE LLELVGGKDN IAAASHCMTR MRFALKDPSK ADVAAIEKLA SVKGSFTQAG 
QFQVIIGNDV ADFYDTFVGI SGVSEASKQD VKSAALQNTN ILQRAMGAIA EVFAPLIPAI
ITGGLILGFR NVLGEMPYFG PEGNQTLASL SVFWTGVYNF LWLIGEAVFH AGIPVGICWS
ITKKMGGTPM LGIVLGLTLV SGQLMNAYAV SGATAADWAT HTWNFDFAQV RMIGYQAQVI
PAILAAITFN YLERFFKKIT PSVIQMIVVP FCSLLLAVMA AHFVLGPIGW TIGSWIGNIV
LAGITSPFAW LFGLIFGAVY APLVITGLHH MSNAVDMQLI ASSNNYGTPL WPMIALSNIA
QGSSVLAMSV LQKHDENAQQ VNIPSIISCY LGVTEPAMFG VNLKYGFPFV CAMIGSSIAG
GVCTAFGVNA LSIGVGGLPG ILSIRPEFIG IFAVCMAIAV VVPFVLTFTI GKRQGIDKGI
DTNVVTLDGE KNSALSA