Gene Sde_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3968 
Symbol 
ID3967272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp5000366 
End bp5001907 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content47% 
IMG OID637923065 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_529435 
Protein GI90023608 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00212226 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.363418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGC TGAATCCTTC CGAAATTAGC GAAATTATTA AGCAACGGAT AGACAACCTA 
GCGGTTTCTA CCGAAGCGCA AAACGAAGGC ACCGTGGTTT CTGTAACCGA CGGTATTATT
CGTATCCACG GTCTTGCCGA TGTTATGTAC GGTGAGATGA TTGAATTTGA AGGCGGTGTT
TACGGTATTG CGCTTAACTT AGAGCGCGAT TCAGTAGGTG CAGTTGTATT GGGTGATTAC
CAAGGCGTTG CCGAAGGTCA GTCCTGTAAG TGTACAGGTC GTATCTTGGA AGTGCCTGTT
GGTGAAGAGC TTTGTGGTCG CGTTGTCGAT GCACTAGGTA ACCCAATCGA CGGTAAAGGC
CCTATTAACG CCAAGAAAAC TGACGCAATC GAAAAAATTG CACCGGGCGT AATTGCTCGT
CAGTCCGTTG ATCAGCCAGT GCAAATTGGT TTGAAAGCAG TTGATACCAT GGTGCCAATC
GGCCGTGGTC AGCGCGAATT GATTATTGGT GACCGCCAAA CAGGTAAAAC AGCTGTTGCT
GTCGATGCCA TCATTAACCA AAAAGGTACA GGTATTAAAT GTATCTATGT GGCTATTGGT
CAAAAAGCAT CGTCTGTTGC ATCTGTTGTG CGTAAATTAG AAGAGCACGG CGCAATGGAT
CACACCATTG TTGTTGCTGC TACCGCTTCT GACCCAGCAT CTATGCAGTT CTTGGCTCCT
TTTGCGGGCT GTACTATGGG TGAATACTTC CGCGATCGCG GCGAAGACGC ACTTATTATT
TATGATGATT TGACTAAACA AGCTTGGGCT TACCGTCAAA TTTCTTTGTT ACTACGTCGT
CCACCAGGCC GTGAAGCCTA CCCAGGTGAC GTTTTCTACT TGCACTCACG CTTACTTGAG
CGCGCTGCAC GTGTAAACGC TGACTACGTA GAGCAACTCA CCAACGGTGA AGTGAAAGGT
AAAACCGGTT CTTTAACCGC ATTGCCAATC ATCGAAACTC AAGCTGGTGA CGTTTCTGCA
TTCGTACCTA CCAACGTAAT TTCTATTACC GATGGTCAGA TCTTCCTTGA AACAGACCTA
TTCAACGCCG GCATCCGTCC TGCAATGAAC GCAGGTATCT CGGTTTCTCG TGTTGGTGGT
TCTGCTCAGA CTAAAGTAAT TAAGAAATTG TCTGGTGGTA TTCGTACCGC ACTTGCGCAG
TACCGCGAAT TGGCGGCTTT CTCTCAGTTT GCATCCGATT TGGACGAAGC AACCAAAGCT
CAGTTAGAGC ACGGTGAGCG CGTAACCGAA TTGATGAAGC AAAAGCAGTA CTCTCCACAA
AGCGTGGGTG AAATGGCTGT TGTTGTTTAC GCTGCGAACG AAGGCTACTT GAAAGACGTA
GAAGTTTCCA AAATTGGTGA TTTCGAATCT GCATTGCTGT CTTACATGAA CAGCTCGCAC
GCCGATTTGA TGAACACCAT GAACGGCGGA AGCTACAGCG ATGAAATCGC TGGGCAACTG
AAGTCTGCTT TGGATACCTT TAAAGCCACG CAAACTTGGT AA
 
Protein sequence
MQQLNPSEIS EIIKQRIDNL AVSTEAQNEG TVVSVTDGII RIHGLADVMY GEMIEFEGGV 
YGIALNLERD SVGAVVLGDY QGVAEGQSCK CTGRILEVPV GEELCGRVVD ALGNPIDGKG
PINAKKTDAI EKIAPGVIAR QSVDQPVQIG LKAVDTMVPI GRGQRELIIG DRQTGKTAVA
VDAIINQKGT GIKCIYVAIG QKASSVASVV RKLEEHGAMD HTIVVAATAS DPASMQFLAP
FAGCTMGEYF RDRGEDALII YDDLTKQAWA YRQISLLLRR PPGREAYPGD VFYLHSRLLE
RAARVNADYV EQLTNGEVKG KTGSLTALPI IETQAGDVSA FVPTNVISIT DGQIFLETDL
FNAGIRPAMN AGISVSRVGG SAQTKVIKKL SGGIRTALAQ YRELAAFSQF ASDLDEATKA
QLEHGERVTE LMKQKQYSPQ SVGEMAVVVY AANEGYLKDV EVSKIGDFES ALLSYMNSSH
ADLMNTMNGG SYSDEIAGQL KSALDTFKAT QTW