Gene Pcal_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_1229 
Symbol 
ID4910158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp1143441 
End bp1144505 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID640124983 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001056120 
Protein GI126459842 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.227271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TCGACAGAGC CGCGGACCTA TTGATATCGC TGTTGGTGAA GCTCATCACG 
CTGGTTAGGA GAGACTGGTA TGCGAAGAAT AGGGCGAGGG TGGAGGAGTG GCGTCTAACT
CTGTACGCGC TTAATAGATC GCCCACGGGG GTCGCGGGCC TCATACTCTC CATGGGGTTT
GTAGTCGTCG GAATCGTCGG CCCCTTTGTG GCCCCCTACG GCTACGACCA GTTCCTCTAC
TTAGAAAACC CGGACCTGTA TCTAGCCCCT CCTGGGTCCT ACGGCATGTT GCTGGGCACG
GACATCTACG GGAGAGACGT TCTCAGCCTC ATGCTCTACG GGGCGAGGGT CTCGCTTGTG
ATTTCTGTGG TTACAATCGC CCTGGGTGTG CCTCTGGGGA TTTTGCTGGG CCTCATCGCC
GGCTACTACG GCGGGAAGAT AGACGAGGCT ATCATGAGGG TGACAGACAT GTTCCTCGCC
TTCCCAGCGC TTGTCCTCGC GCTTGCGCTC GCCGCGACTC TGCCCCAGAG GATTAGGGAG
GCGTTGGTGG AGAACCAAGC CTTTGCATAC GCCATGGCTG CGATCTTCGG CGTAAAGCCC
GACGACGCTA TCCACCTCGC GCCTCTCATC TCCATCTTCA CAGCATTGAT AATTGTGTGG
TGGCCCACCT ACGCGAGAGT CGTTAGAGGA ATGGTTTTAG TAGAGCGTGA GAAGACCTAC
GTGGAGGCGG CTAAGGCGCT GGGGTACTCC TCTTGGAGGA TTATGACGAG GCACATTTTG
CCCAACATAA TGTCCCCAGT GGTTGTGTTA ATAACCTTCG ACTTCGCCTC GGTGAACTTG
CTCGCGGCGG GGCTAAGCTT TTTAGGCCTC GGCGCGCAGC CCCCCATAGT GGATTGGGGC
TCTCTCATAA ACATGGGCGG TAGCCGCTTC CCCACTGCGT GGTGGCTTGT GTTCTTCCCA
GGTGTCGCCA TTTTCCTGAC GGCACTGGGG TGGAACCTCC TTGGGGACGC TCTACGCGAC
GTGTTTGACC CCAAGTTTAG GAGGAGGATA GAGTTTAGGG TATGA
 
Protein sequence
MKILDRAADL LISLLVKLIT LVRRDWYAKN RARVEEWRLT LYALNRSPTG VAGLILSMGF 
VVVGIVGPFV APYGYDQFLY LENPDLYLAP PGSYGMLLGT DIYGRDVLSL MLYGARVSLV
ISVVTIALGV PLGILLGLIA GYYGGKIDEA IMRVTDMFLA FPALVLALAL AATLPQRIRE
ALVENQAFAY AMAAIFGVKP DDAIHLAPLI SIFTALIIVW WPTYARVVRG MVLVEREKTY
VEAAKALGYS SWRIMTRHIL PNIMSPVVVL ITFDFASVNL LAAGLSFLGL GAQPPIVDWG
SLINMGGSRF PTAWWLVFFP GVAIFLTALG WNLLGDALRD VFDPKFRRRI EFRV