Gene Francci3_3650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3650 
Symbol 
ID3905331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4360243 
End bp4361493 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID637880973 
Productaminotransferase, class V 
Protein accessionYP_482731 
Protein GI86742331 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTACC TCGACCACGG GGCGACGACG CCGATGCGAC CGGAAGCGCT CGCCGCGTAC 
ACCGCGGTGC TCGCCGATAC CGGCAATGCC TCGTCGCTGC ATGCCAGTGG CCGCCGTGCC
CGCCGCATCG TCGAGGAGTC CCGCGAGACA CTCGCCGGAG TGCTGGGCGC CCGTCCGTCC
GACGTGGTGT TCACCGGAGG CGGTACCGAA AGTGACAACC TCGCCCTGAA GGGTCTGTAC
TGGTCACGCC GCCGGGCCGA ACCGGGCCGG CGGCGGGTGC TGGTCAGCGC CGTCGAGCAC
CGGGCGGTCC TGGACACCGT CGACTGGCTC GGCCGGGCGC AGGATGCCGA GGTCGAGCTG
CTCGCGGTCG ATGCCGCGGG AACCGTGCGG CCGGACACCC TCGCCGCCGC CCTCGAACGG
GATCCGGACT CGGTGGCCGT CGTGTCGGTG ATGTGGGCGA ACAACGAGGT CGGCACCGTT
CAGCCGATCG CCGAGCTCGC GACGATCGCG CACCGTCACG GCGTGCCCTT CCACACCGAT
GCGGTGCAGG CGTTCGGCCA GATCCCGATC GCCGTCACCG ACGAGGGCCC CGACGCCATC
ACGGTCAGCG CGCACAAGAT CGGGGGGCCG ATCGGTGTCG GTGCCCTGGT GGTGCGCCGG
GGATTGGCGA TGGAGCCGCT GACCCACGGC GGCGGTCAGG AGCGCGACAT CCGGTCCGGA
ACGTTGAACA CGGCCGGGGT GGCGGCGTTC GCGGCGGCCG CGGCGAGGGC ATGCGCCGAG
GCGCCGCAGG AGAGCGTCCG GCTGGCGGCC CTGCGTGACG ACTTGGTGCG CCGGGTCCGG
GCGGAGGTCC CGGAGGCGGT GCTCAACGGT GCCCCGCTGC TCGGGGACGG CGGCGGTGGG
GACGGCGGCG GTGGGGACGG CGGCGGTCCG GGACCGCACC GGCTGCCGGG AAACGCCCAT
CTGACCTTCC CCGGCTGCGA GGGAGACTCG CTGCTGATGC TGCTCGATGC CCGGGGGATC
GAGTGTTCCA CCGGCTCGGC CTGCTCCGCT GGAGTGGCGA GGCCGTCGCA CGTGTTGCTC
GCGATGGGAG TGGATGAGGC ACACGCCCGC GGATCGCTGC GGTTCTCCCT CGGGCACACC
TCACGGGCCT GCGACATCGA CGCGCTGGTC GCGGCGATCG GGCCGGTCGT CGAGCGGGCG
AGCCGCGCGG GGGCGCTGGC CGGCACGAGC GGCAGCATGA GCGGCACCTG A
 
Protein sequence
MTYLDHGATT PMRPEALAAY TAVLADTGNA SSLHASGRRA RRIVEESRET LAGVLGARPS 
DVVFTGGGTE SDNLALKGLY WSRRRAEPGR RRVLVSAVEH RAVLDTVDWL GRAQDAEVEL
LAVDAAGTVR PDTLAAALER DPDSVAVVSV MWANNEVGTV QPIAELATIA HRHGVPFHTD
AVQAFGQIPI AVTDEGPDAI TVSAHKIGGP IGVGALVVRR GLAMEPLTHG GGQERDIRSG
TLNTAGVAAF AAAAARACAE APQESVRLAA LRDDLVRRVR AEVPEAVLNG APLLGDGGGG
DGGGGDGGGP GPHRLPGNAH LTFPGCEGDS LLMLLDARGI ECSTGSACSA GVARPSHVLL
AMGVDEAHAR GSLRFSLGHT SRACDIDALV AAIGPVVERA SRAGALAGTS GSMSGT