Gene Franean1_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1709 
Symbol 
ID5670111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2041745 
End bp2042938 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content73% 
IMG OID641240627 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001506053 
Protein GI158313545 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0706412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0202078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCT TGGACAGGCA GCCGGCTCCG CCGGAGCGGG AACAGGCCCG GCGCGGGCCG 
GGAGCGCCCC GCCGCGCGGT GCTCATGCTC GAGGACGGCC GGAGCTTCGC CGGGGACGCG
TTCGGCTCGG TCGGCGAGGC GTTCGGCGAG GCGGTCTTCT CCACCGGGAT GACCGGCTAC
CAGGAGACCC TCACCGACCC GTCGTTCCAC CGTCAGGTCG TGATCATGAC GGCGCCGCAC
ATCGGCAACA CCGGGGTGAA CGACACCGAC TACGAGTCCG ACCGCATCCA GGTGGCCGGC
TTCGTTGTGC GGGACCCGAG CCGGCTGGCG TCGAACTGGC GCGCCCAACG CACCCTGGAC
GACGAGCTCG AGAACGCCGG CGTGGTCGGG ATCAGCGGGG TCGACACCCG CGCGCTGACC
CGTCACCTGC GCGAGCGCGG CGCGATGCGG TGCGGGGTCA GCAGCACCGA CACCGATCTC
GACTCGCTGC TCGACCGGGT GCGCGAGTCG CCGGAGATGG TCGGCGCGGA CCTCGCCCCG
GAGGTCAGCA CGGACAAGCC CTACGTCGTC GAGGCGCGGT CGGGCCTGCC GCTCTTCACC
GTCGCCGCGC TGGACCTGGG CATCAAGCGG AACACCCCGC TCTCCATGGC AGCGCTGGGC
TGCGAGGTGC ACGTGCTGCC GGCCCGCAGC ACGGCCGCCG AGCTGCTGGC CCTCTCGCCC
GACGGGGTCT TCCTCTCGAA CGGCCCGGGT GACCCGGCCC GCGCGGACTA CGCGGTCGAG
ACGCTCACCG GGGTGCTGGA GGCGGGTGTC CCCGTCTTCG GGATCTGTTT CGGCAACCAG
GTGCTCGCAC GGGCCCTGGG CTTCGAGACG TACAAGCTGA CCTACGGCCA CCGCGGCGTG
AACCAGCCCG TGGCCGACAC CCGGACCGGC CGGATCGCGG TCACCAGCCA CAACCACGGC
TTCGCGGTGC GCGCGCCGCT GACCGGTACG ACCGACACCC CCTACGGGCG GGTCGAGGTG
AGCCACGTGG CGCTCAACGA CGACGTGGTG GAGGGCCTGA CCTGCCTGGA CGTGCCGGCG
TTCAGTGTCC AGTTCCATCC CGAGGCGGCG CCTGGCCCGC ACGACGCCCA GGGACTGTTC
GACCGGTTCT GCGGCCTGAT GGCGGCCGGC CGGCGGAAGC GGGGAGAAGG CTGA
 
Protein sequence
MTGLDRQPAP PEREQARRGP GAPRRAVLML EDGRSFAGDA FGSVGEAFGE AVFSTGMTGY 
QETLTDPSFH RQVVIMTAPH IGNTGVNDTD YESDRIQVAG FVVRDPSRLA SNWRAQRTLD
DELENAGVVG ISGVDTRALT RHLRERGAMR CGVSSTDTDL DSLLDRVRES PEMVGADLAP
EVSTDKPYVV EARSGLPLFT VAALDLGIKR NTPLSMAALG CEVHVLPARS TAAELLALSP
DGVFLSNGPG DPARADYAVE TLTGVLEAGV PVFGICFGNQ VLARALGFET YKLTYGHRGV
NQPVADTRTG RIAVTSHNHG FAVRAPLTGT TDTPYGRVEV SHVALNDDVV EGLTCLDVPA
FSVQFHPEAA PGPHDAQGLF DRFCGLMAAG RRKRGEG