Gene Francci3_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0832 
SymbolureC 
ID3905109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp971359 
End bp973086 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content71% 
IMG OID637878165 
Producturease subunit alpha 
Protein accessionYP_479945 
Protein GI86739545 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGAC TCGACCGCTC CCGCTACGCG GCGCTGTACG GCCCCACGGT CGGCGACCGT 
ATCCGGCTCG CGGACACCGA CCTGTTCATC GAGGTGACGG AGGACCGCAG CCGCGGGCCG
GCGGGCACCG GGACCGGTGA CGAGGCCGTC TTCGGGGGCG GCAAGGTCAT CCGTGAGTCG
ATGGGGCAGT CCCTGGCAAC CCGCGCCGAG GGTGCGCCCG ACCTCGTCAT CACCGGCGCC
GTCGTTCTCG ACCACTGGGG TGTGGTCAAG GCCGACGTGG GGATCCGCGA TGGCCGGATC
GTCGCGCTGG GTAAGGCCGG CAACCCGGAC ACAATGGACG GCGTCCATCC CGAGCTGGTG
ATGGGGCCCG GCACCGAGAT CATCGCCGGT AACGGGAAGA TCCTCACCGC CGGCGCGGTG
GACTGCCACG TCCATCTGAT CTGCCCGCAG CAGGTTCCGG AGGCGCTGGG CGCCGGCATC
ACCACGCTGA TCGGCGGCGG CACCGGCCCG GCCGAGGGCA CGAAGGCGAC GACGGTGACG
CCCGGATCCT GGAACCTGGC GCGGATGCTG TCCGCGCTGG ACGACTGGCC GGTCAACGTT
GTCCTGCTCG GCAAGGGCAA CACGGTCAGC GACGAGTCGA TGTGGGAACA GCTGCGCGCC
GGGGCGGCCG GCTTCAAGCT GCACGAGGAC TGGGGGACGA CGCCGGCCGC AATCGACGCC
TGCCTGCGGG TCGCCGACGC CGCCGGCGTC CAGGTTGCGC TGCACTCCGA CACCCTGAAC
GAGGCCGGCT TCGTCGAGGA CACGCTGGCC GCCATCGCCG GCCGGGCGAT CCATGCGTAC
CACACCGAGG GGGCCGGCGG CGGGCATGCC CCGGACATCA TCACGGTCGC GGCCGCCGGC
AACGTGCTGC CGTCATCGAC GAATCCGACC CGCCCGCACA CCGTCAACAC CCTCGACGAG
CACCTCGACA TGCTGATGGT CTGTCACCAT CTCAACCCCA GCGTGCCCGA GGACCTGGCC
TTCGCCGAGA GCCGGATCCG GCCGTCGACG ATCGCCGCCG AGGACATCCT GCACGACCTG
GGCGCGATCT CGATGATCGG TTCGGACTCG CAGGCGATGG GCCGGATCGG CGAGGTCGTC
CTGCGTACCT GGCAGACCGC GCATGTGATG AAGCTCCGGC GCGGGTCCCT GGCCGGCGAC
GATCGGGCCG ACAACACCCG GGCCCGACGC TACATCGCCA AGTACACGAT CTGCCCGGCG
GTGGCGCACG GCCTGGACGC GGAGATCGGC TCGGTGGAGC CCGGCAAGCT CGCCGACCTG
GTGCTGTACG ACCCGGCGTT CTTCGGGGTG CGGCCGTCGC TGGTCCTCAA GGGCGGGTTC
ATCGCCTGGG CCGCGATGGG CGATGCGAAC GCCTCCATCC CGACCCCGCA GCCGGTGCTG
CCCCGGCCGA TGTGGGGAGC CGCCCGTGGT CCGGCCGCCG CGTCGTCGTT GATCTTCGTT
GCCCCGGCGG CGATCGAGGA CGGGCTGCCC GGACGGCTGG GGCTCGCCAC CCCGGTGGTG
CCGGTCGCGG ATGTGCGCCG CCGCGGCAAG GCGGATCTTC CCGAGAACAC CGCGACACCG
GACATCCGGG TGGACCCCGA CACGTTCACC GTCACGGTGG ACGGTGAGGC GATCGAGGCG
GATCCGGTGA GCGAACTCCC GATGACCCAG CGCTACTTCC TGTTCTGA
 
Protein sequence
MSRLDRSRYA ALYGPTVGDR IRLADTDLFI EVTEDRSRGP AGTGTGDEAV FGGGKVIRES 
MGQSLATRAE GAPDLVITGA VVLDHWGVVK ADVGIRDGRI VALGKAGNPD TMDGVHPELV
MGPGTEIIAG NGKILTAGAV DCHVHLICPQ QVPEALGAGI TTLIGGGTGP AEGTKATTVT
PGSWNLARML SALDDWPVNV VLLGKGNTVS DESMWEQLRA GAAGFKLHED WGTTPAAIDA
CLRVADAAGV QVALHSDTLN EAGFVEDTLA AIAGRAIHAY HTEGAGGGHA PDIITVAAAG
NVLPSSTNPT RPHTVNTLDE HLDMLMVCHH LNPSVPEDLA FAESRIRPST IAAEDILHDL
GAISMIGSDS QAMGRIGEVV LRTWQTAHVM KLRRGSLAGD DRADNTRARR YIAKYTICPA
VAHGLDAEIG SVEPGKLADL VLYDPAFFGV RPSLVLKGGF IAWAAMGDAN ASIPTPQPVL
PRPMWGAARG PAAASSLIFV APAAIEDGLP GRLGLATPVV PVADVRRRGK ADLPENTATP
DIRVDPDTFT VTVDGEAIEA DPVSELPMTQ RYFLF