Gene Acry_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2097 
Symbol 
ID5161881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2319702 
End bp2322776 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content68% 
IMG OID640554019 
Producthypothetical protein 
Protein accessionYP_001235215 
Protein GI148261088 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGAA CGGTCCTGGT TCTGGCCGCC CTTGCCGGCC TGGCGCTGTT GAGCGCGTGC 
ACGCGGCCGC CGCCGGGCGC CTATGCGCAT GACACGGATT CCCCGCAATC GGCTAGCGGC
CGGCAACTCG CGCTGGGCAA AAATACCAGC GGCGAAACCT GCACCGCCAG CCGCACGACC
TCCGGCGGCG CATTGATCTA CTGCGGCAGT TGGCACCAGC CCAGCGCCGA AGTGCATGCG
GGTCCGGAGG CGAACCGGGC GAACCTGAAT GCAATCGTTA CGCGAGGAAA CTGGCGGGCC
GGTCTCGAAC AGCGCTACGC GTGCGGCAAT CCAAAGCCGA CAACGGTGCT CGGGCACTAC
CCTGGAGAGG TTCTGCTCTG CACGCAGCGG ATCGGCGGCT GGCCGCATGT CGCCCTGGCA
GCCTTGATCC GCGGCCATGT GTGGTTTGCC GATGGCGTGC AACCGGCTTT TCCAGCCATG
GAACGCGCCG TCGGCGTGAT CAGCGGCGCA GTGCCAGCCG CCGATGCCGG CGCGATGACC
ACTTCGGTTT CCGACAGCCA GCTTGCCGAC CGCCTCGCCG CCGCCGCCTT CACCTCGGGT
GATATCCACG AGTACGGCAA GTTGATGACG CTCGGCAACC GCTCCAACCA GGCCGAGGAT
TACCCTGCCG CCATCACCGC CTACCGGGCC GCGCTCGCGC TGCAGCAAAA GGCGCTTGGG
CCGGAAAATC CGAATACGGT GACGCCGATG CTGGATCTTG CCCTGAATCT CTCGGATCAG
GGCGATTATA GCCAGGCTGA CGCCCTGCTT GCGCAGGCGG CGAAGCTGGC TCCCCTGGCC
GTGGATCCCA CGGCGGTCGC GCGACTCGAC CATTATCGGG GTCTCGATCA ACTCAATCAG
GGTCATGATC GCCGTGCGCT GCGGCTGCTC TCGGCCGCAA ACCGGGATTA TGCATCGCTT
CTGCCCCGCT CCGTGTTGCG CCCCCACCCT GCCGCGACCG GGTCCGACGA TGGCTTCGGC
CTTTCGGCCG GAAGCCGGGG GGGATCGCTC CTCGCCGCGC AGTCGACCCT GTTGAGCCCG
ATCGGGCAGC AGGCCCTGCT CGGCGTGATC GAAACACTGC GGTATGAGGG CGTGGTGCTC
GATGCGGCCG GGCATCACAA GCAGGCCGCC GTGAAAATTG ACCGTGCATC GGCGATCGCT
TCGGCGAACG GGATTGCTCC TCCCGTGCTT CGCGCCAGGC TTGACCGGAG CACGGCCGCC
GTCGACACGG CGTTTGACAG GACCGCCTCG GCTGCCGCCA GGCTGGCGCA AGCCTCGTAT
GATTTCGAAC ATGCCCTGCC CGGTTCGCGC CCGGTCGCCG ACACGCACCT GCTGCGCGGC
GGCGTGCTCG ACCTCGCCGG TCAACCCGGA AAGGCCCTGG AGGCTTGCCG GGCCGGCATA
AAGCTTCTCG CCCGCCTGCG CCTCGGCACC TCTGCGGCCC TGGTTGCTCC GTGCCTCGAC
GCGGCCAATC GGGAAGCGGA GAAGAATCCT GCCCGGGCCG GGGTGCTTCG CGCCGAGATG
TTCGCGATGG CGGAACTCGC GCAAGGCAGC ACGACCAGCC GTGAAATCGC CGAGTCCGCC
GCGCGGCTGG CCGCGGATTC AAAGAGCCCC AAAGTCGCCG CCGCGATCCG CGCCCATCAG
GATGCCGTCA TCGCCCTGTC GCGCCTCTAC CGGGAGCGTG ACGGCATCGC CCATGAACGG
CAGGCAACGC CGGCCGCCAT CAACGCAATC GACATGAAGA TCGCGGCGGC AACCCGGCGG
CTCGCCGAAG CGGACGAGTC CGTCGAGGCG GCGGCACCGA ATTTCGGCCA GCTGGTCCAG
CAGGTCGTGC CCGCCGGCAC CGTGCTCGAC CGGCTGAGGC CTGGCGAAGC CTTTCTCGAT
ATCATGCCGG CGCCGGACGG CACGTGGACG TTCCTGTTGC ACGACGGGCG GATCGCCGTC
GCGCACACGA AGGTCGACGA GACGCGCATG ACCGCACTGG TCCGCAAAGT GCGCCAAAGC
GTCGTGCCGA CGCAGGCGGG ATTGCCGACC TTCGCGATGA CCTCTGCCGA AGCGATCTAT
CGCGCGACCC TGGCCCCCTT CGCGACCGAC CTCGCGAAGA CCACGGCGAT GGTGGTTTCA
CCGGCCGGGG CGCTGCTGTC GCTGCCATTC GCGCTGCTGC CCACCGAGCC GGTCGCCCCC
GGCACGCCGC TGGCCAAGGT GCCCTGGCTG ATCAGGAAGA TGACGCTGGT CTACGTTCCC
GCGGCCGCGA ATTTCGTCTC GCTGCGCGCC ATCGCCGGCA CGTCGCCCGC CAAGCAGCCA
TGGTTTGGCT TCGGGGACTT CAAGAACACG AGCCTGGCCC AGGCCGAGGC CACGTTCAGT
GGCCCCAGTT GCGGCGACAG CGCCCGGCTC TTCGCCGGCC TGCCCCACCT GCCGTATGCG
AAGCTCGAAC TTGACGCCGC CCGCGCGATC TTCAAGGCAC CCGCATCGGA CGAGCTGCTG
GGGGCGGCCT TCACGGTCCC GAATGTCGAG CACGCCGACC TGAAGCAATA CCGGATCCTG
CATTTCGCGA CCCACGCGCT GCTGCCCTCC GAACTGCCCT GCGCTCATGA ACCGGCGATC
GTCACCTCGC CGCCGCCCGG CGCCAGGTCC GCGGCGAACT CGATGCTGAC GACCTCCGAC
ATCACCAATC TCAAACTGAA CGCCGACCTC GTGATCCTGT CGGCATGCAA TACGGGCGGG
GGCGACGGAA AGGCCGGCGG TGAAGCGCTT TCCGGCCTTG CCCGCGCGTT CTTCTTCGCC
GGCGCCCGCG CGCTGATGGT CACGCAATGG TCGGTGAACG ACCAGGTCAG CTCCTATCTG
GTCGCAACCA CGCTCACCCA TCTGGCCTCA TCGACGGGCG AGGGGGCGGC CGCGAGCCTG
CGGAGCGCCC AGCTCGACCT GATCAGGGGC GCCGCGTCCG GCACCCTGCC GGCCAAGCTC
GCCGATCCGT TCTTCTGGGC GCCTTTCGTG GTCATCGGCG ACGGTGGACA GGGCACGCGC
AATCTGGCCA AGTAG
 
Protein sequence
MRRTVLVLAA LAGLALLSAC TRPPPGAYAH DTDSPQSASG RQLALGKNTS GETCTASRTT 
SGGALIYCGS WHQPSAEVHA GPEANRANLN AIVTRGNWRA GLEQRYACGN PKPTTVLGHY
PGEVLLCTQR IGGWPHVALA ALIRGHVWFA DGVQPAFPAM ERAVGVISGA VPAADAGAMT
TSVSDSQLAD RLAAAAFTSG DIHEYGKLMT LGNRSNQAED YPAAITAYRA ALALQQKALG
PENPNTVTPM LDLALNLSDQ GDYSQADALL AQAAKLAPLA VDPTAVARLD HYRGLDQLNQ
GHDRRALRLL SAANRDYASL LPRSVLRPHP AATGSDDGFG LSAGSRGGSL LAAQSTLLSP
IGQQALLGVI ETLRYEGVVL DAAGHHKQAA VKIDRASAIA SANGIAPPVL RARLDRSTAA
VDTAFDRTAS AAARLAQASY DFEHALPGSR PVADTHLLRG GVLDLAGQPG KALEACRAGI
KLLARLRLGT SAALVAPCLD AANREAEKNP ARAGVLRAEM FAMAELAQGS TTSREIAESA
ARLAADSKSP KVAAAIRAHQ DAVIALSRLY RERDGIAHER QATPAAINAI DMKIAAATRR
LAEADESVEA AAPNFGQLVQ QVVPAGTVLD RLRPGEAFLD IMPAPDGTWT FLLHDGRIAV
AHTKVDETRM TALVRKVRQS VVPTQAGLPT FAMTSAEAIY RATLAPFATD LAKTTAMVVS
PAGALLSLPF ALLPTEPVAP GTPLAKVPWL IRKMTLVYVP AAANFVSLRA IAGTSPAKQP
WFGFGDFKNT SLAQAEATFS GPSCGDSARL FAGLPHLPYA KLELDAARAI FKAPASDELL
GAAFTVPNVE HADLKQYRIL HFATHALLPS ELPCAHEPAI VTSPPPGARS AANSMLTTSD
ITNLKLNADL VILSACNTGG GDGKAGGEAL SGLARAFFFA GARALMVTQW SVNDQVSSYL
VATTLTHLAS STGEGAAASL RSAQLDLIRG AASGTLPAKL ADPFFWAPFV VIGDGGQGTR
NLAK