Gene Franean1_7300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7300 
Symbol 
ID5675601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8921025 
End bp8922443 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content77% 
IMG OID641246137 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_001511525 
Protein GI158319017 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3572] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01436] glutamate--cysteine ligase, plant type
[TIGR03444] glutamate--cysteine ligase family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.038587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCCCG CCACCCCGGG CTCACCGGCT CCGGGCGCGG CCACGACGAA CCGGGTAGGC 
ATCGAGACCG AGTGGTTCGT CGTCGACGTC GCGAACCCGC GCCGGCCCGT CCCCGCGGAC
GAGACCGCCG CGGCGTTGGA GAGCGTCCTC GCCCCCGGTG ACCAGGGCCG GCAACCGGAC
CGAGGCCGGG AGAACCGAGG CCGGGAGAAC GGCCAGGCCC GCGACGGCGG CCCGCCGGGC
ACCGACTCGG CCCTGCGGCT TCCCGGCGGC AGCCGGCTCA CCTTCGAGCC GGGCGGCCAA
CTCGAGCTCT CGGGCCCTCC GCTGGACCTG ACCGCCGCCG TGGACGCGAT GCGCGCCGAC
CTCGCGCTCG TCCGCGGCGC CCTCGCCCGG CGCAGGCTCG GCCTGGTCGG CATGGGTGTC
GACCCGCTGC GGACGCCCGT CCGCCACACC ACCGCCTCCC GGTACGTGGC CATGGAGCAG
CACTTCCTGG CCCGGGACTG CGACGACGGC CGGACGATGA TGTGCTCGAC CGCCTCCGTC
CAGGTCAACC TGGACATCGG CACGGACACG GCGCAGGCCG TCGAGCGTTT CCGGCTCGCG
CACGCGCTCG AGCCGGTGCT GATCGCCATG TTCGCCGCGT CGCCGCTGGC CCACGGCCGG
CAGATCTCCT GGCAGTCGGG TCGCCAGGCG GTCTGGGCCG GTATCGACGC GAGCCGCACC
GGGCCGGTCC TGCCCGACCC GGCGGCGCCG GCGACGGCGT GGCTCGCCCA GGCGGGCCCG
GGCGGGCGCG GCGACGCCCC CGGCGCGCCG CCCACCGGAG ACGCCCAGCT GGCCTGGCTC
TGGGCCCGCT ACCTGCACGA CGCGGACCTC ATGATGGTCG GCGAGCCGGA CGGGCGGTAC
CAGCCGGTCC GGAACCGGAC GACCTTCGGG GACTGGATGG CCGGGGCAGG GCCGGTGGCA
CGCCCCCCGA CCGCCCAGGA CCTCGGCTGG CACGCCACGA CGCTGTTCCC GCCCGTCCGG
CCACGTGGCT GGCTGGAGCT GCGTTACCTG GATGCTCAGC CGTCCGACCT GTGGCCGGTG
GCCGTCGCCG TGCCGGCGGT GCTCCTCGAC GAGCCGGCCG CCGCCCGGGC CGCGCTCGCC
GCGTGCCTCC CGGTCGCCCG GCAGGGGCGG CTGGCGGCCC AGCTCGGCCT GCGGGACCCG
GCACTGCACC GTGCGGCGGT GCGGTGCGTC GACCTGGCCC TGGACACGCT CGCCCGGACG
GGCGCCGATC CCGGGCTGCG GGCCGCGGTG GAGGCCTTCG CCAACCGTTA CACGCGGCGG
GGGCGCTCCC CGGCCGACGA CCTGAACGCG CGTTTCGAGG CGCGCGGGCC GGCTGAGCTG
CTGCGCGAGG AGGCGTCCCA GTGCGTTCCG GTGCCTTGA
 
Protein sequence
MTPATPGSPA PGAATTNRVG IETEWFVVDV ANPRRPVPAD ETAAALESVL APGDQGRQPD 
RGRENRGREN GQARDGGPPG TDSALRLPGG SRLTFEPGGQ LELSGPPLDL TAAVDAMRAD
LALVRGALAR RRLGLVGMGV DPLRTPVRHT TASRYVAMEQ HFLARDCDDG RTMMCSTASV
QVNLDIGTDT AQAVERFRLA HALEPVLIAM FAASPLAHGR QISWQSGRQA VWAGIDASRT
GPVLPDPAAP ATAWLAQAGP GGRGDAPGAP PTGDAQLAWL WARYLHDADL MMVGEPDGRY
QPVRNRTTFG DWMAGAGPVA RPPTAQDLGW HATTLFPPVR PRGWLELRYL DAQPSDLWPV
AVAVPAVLLD EPAAARAALA ACLPVARQGR LAAQLGLRDP ALHRAAVRCV DLALDTLART
GADPGLRAAV EAFANRYTRR GRSPADDLNA RFEARGPAEL LREEASQCVP VP