Gene Bind_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3155 
Symbol 
ID6201454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3599181 
End bp3600428 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID641707103 
Productsqualene-associated FAD-dependent desaturase 
Protein accessionYP_001834205 
Protein GI182680059 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03467] squalene-associated FAD-dependent desaturase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCGTC CCATCGTTCA TATCGTCGGC GCGGGTCTAG CCGGCCTTGC CGCCGCTATC 
GCGCTTGCCG ACGGGCGCCG TGAAATCATC CTCTACGAGG CCGCAAGACA GGCGGGAGGC
CGTTGCCGCT CCTATTTCGA CACAGCCCTT GGCATGGTGA TCGATAATGG CAATCATTTG
CTGCTTTCAG GCAATCACGA GGCGCGCCGT TTCCTGCGCA CCATTGGCTC GGAGGCGAAT
CTTCAAGGGC CCACGGAAGC CGATTTCCCA TTTTTCGACC TCGCGACGGG CGAACATTGG
CGCGTGCGGC CCAATGCCGG CCCCCTGCCC TGGTGGGTTT TTTCGAAGAA TCGCCGCGTG
CCGGGCACAA AACCGCTCGA TTATCTCGGC CTCGCCAAAT TGCTCCTGAC GCGCGAGGAT
AGAAAAATCG GAGAGATTCT GTCCGACCAG GGACCGCTCT ATCAAAAACT TTGGCGGCCA
TTCCTGCTCG CCGCCCTCAA TCTCGAGCCG CCCGAAGGCT CCTCCGCGCT CGCCGCCTCT
GTCATCCGCG AGACACTCGC CAAGGGTGGG CAGGCCTGCC GCCCCATGAT CGCCCATCCG
ACACTTTCCG CCGCCTTCAT CGAGCCCGCG CTCGCCTGGC TTTCACAACG GGGCGCCGAA
ATTTGCCTCG ACCATCGGCT ACGGACGATC CGTTTCGAGG GGGACCGCGT CGCGGGCCTC
GAATTTGGTG ATGCGCGCAT GAGCTTGCGG CCGCAGGACA CCCTGGTCCT TGCCGTACCC
GCCCCCGTGG CGCAGGAACT CGTGCCAGGA CTTCAAGCGC CACAGCGCTT CACCGCCATC
GTCAATGCCC ATTTCAAGAT CACGCCGCCG GCAGGATTCC CGCCCATTCT CGGCCTCGTC
AACAGTGTCA GCGAATGGCT CTTCGCCTTC CCCGAACGGC TCGCGGTGAC GATCAGCGGC
GCCGACCATC TGCTCGACGA GCCGCGCGAG GTCCTCGCCG CAAAAATCTG GGCCGAGGTG
GCAAAAGTGA CACAGATTGC GGCGCCGCTT CCCGCATGGC AAATTCTCAA GGAAAAGCGG
GCGACCTTCG CCGCGACGCC CGAAGAAAAC GCGCGCCGCC CGGGGGCACG CACCGCATTT
GCCAATCTGG TGCTCGCCGG CGACTGGACC GCGACCGGAT TGCCCGCAAC GATCGAGGGT
GCAATACGCT CCGGCAATTT CGCGGCGCGC GCGCTTCTCG CGAACTGA
 
Protein sequence
MDRPIVHIVG AGLAGLAAAI ALADGRREII LYEAARQAGG RCRSYFDTAL GMVIDNGNHL 
LLSGNHEARR FLRTIGSEAN LQGPTEADFP FFDLATGEHW RVRPNAGPLP WWVFSKNRRV
PGTKPLDYLG LAKLLLTRED RKIGEILSDQ GPLYQKLWRP FLLAALNLEP PEGSSALAAS
VIRETLAKGG QACRPMIAHP TLSAAFIEPA LAWLSQRGAE ICLDHRLRTI RFEGDRVAGL
EFGDARMSLR PQDTLVLAVP APVAQELVPG LQAPQRFTAI VNAHFKITPP AGFPPILGLV
NSVSEWLFAF PERLAVTISG ADHLLDEPRE VLAAKIWAEV AKVTQIAAPL PAWQILKEKR
ATFAATPEEN ARRPGARTAF ANLVLAGDWT ATGLPATIEG AIRSGNFAAR ALLAN