Gene PICST_29031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29031 
SymbolCTP1 
ID4851767 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2778041 
End bp2779209 
Gene Length1169 bp 
Protein Length294 aa 
Translation table 
GC content42% 
IMG OID640393475 
Productcitrate transport protein 
Protein accessionXP_001387096 
Protein GI126275540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0488853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.528605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTC AAAAGGTATG TCCTACAGAG CTATTATGCT TTTGAAATTG ATAGAGAGAA 
GGGAAAGGGT CAAAAGTGAA TTAAAAGTGA AAAATTAAAG TGAAAATTGC TGAAGAATTC
AAATTGAAAA ATTAGTCAAA GGTGAAAAAT TAAATTAGAG AGAGTGAATT CTTACAAGGG
CACAACAGTG AAAGTTTTAA AAGTGTTAAT GTTCAACAGC CTTAGAACAA ATAAGATTGC
TGTCTAATTC TATCAATTCT ATCAAAGCTT CTTGTAATGC TTTCTACTAA CTTCTTCAGA
AAAAGGTTGA TCCTCTTAAG TCATTCATAG CTGGAGGTAC AGCTGGTGCT ATCGAGGGTG
TCATCACCTA CCCCTTCGAG TTTGCCAAAA CAAGACTACA GTTGATCGAC AAATCAGCCA
ACATATCTAG AAACCCATTG GTGTTGATCT TCAATGTGGC CAAGACTCAA GGCGTGGGCT
CCCTTTACGT AGGATGTCCA GCCTTTGTAG TAGGAAACAC CGTCAAGGCT TCTGTTCGTT
TTCTTGGGTT CGATTCCATC AAGGCGCTTT TGGCTGACAA GAACGGAAAG TTGTCTGGTC
CTAGAGGTGT GATAGCCGGA CTCGGTGCTG GTTTGCTTGA GTCTGTGGTG GCGGTGACTC
CTTTTGAAGC AATCAAAACA GCTTTGATTG ATGACAAACA ACTGGCTAAA CCAAAATACC
AAAACGGACT TGTTTCAGGC ACATTAAAGC TTTGTAGAGA TCTTGGATTC AAGGGCATTT
ATGCTGGTGT GGTACCAGTT TCGTTGAGGC AGGCTGCTAA CCAGGCTGTC AGATTGGGAT
CTTACAATGC CATCAAGACT ATGATTCAAC AAGCCAGTGG CTCTCGTCCG GACCAGCCTT
TAAGTTCAGT AGCTACCTTT GCTGTAGGTT CTTTTGCTGG AATTATCACT GTCTATACTA
CCATGCCTAT CGATACCGTC AAGACCAGAA TGCAAGCCTT AGGTGCAGAT AAGCTCTACA
CATCTACCGT CAACTGTTTC GCTAAGATCT TTAAGGAAGA AGGTCTCTTG ACGTTCTGGA
AGGGAGCCAC TCCACGTTTG GGCAGATTGG TGTTGAGTGG TGGTATTGTT TTTACCATCT
ACGAAAAGAT GTTGGTGATC ATGGGCTGA
 
Protein sequence
MSAQKKKVDP LKSFIAGGTA GAIEGVITYP FEFAKTRLQL IDKSANISRN PLVLIFNVAK 
TQGVGSLYVG CPAFVVGNTV KASVRFLGFD SIKALLADKN GKLSGPRGVI AGLGAGLLES
VVAVTPFEAI KTALIDDKQL AKPKYQNGLV SGTLKLCRDL GFKGIYAGVV PVSLRQAANQ
AVRLGSYNAI KTMIQQASGS RPDQPLSSVA TFAVGSFAGI ITVYTTMPID TVKTRMQALG
ADKLYTSTVN CFAKIFKEEG LLTFWKGATP RLGRLVLSGG IVFTIYEKML VIMG